欧洲专利EP3709600A1 Load balancing

专利PDF首页>>欧洲专利

专利附录

专利说明

权利要求

类似技术

同族专利

引用文献

法律状态

优先权

专利摘要:
Some examples provide a novel method for load balancing data messages that are sent by a source compute node (SCN) to one or more different groups of destination compute nodes (DCNs). In some examples, the method deploys a load balancer in the source compute node's egress datapath. This load balancer receives each data message sent from the source compute node, and determines whether the data message is addressed to one of the DCN groups for which the load balancer spreads the data traffic to balance the load across (e.g., data traffic directed to) the DCNs in the group. When the received data message is not addressed to one of the load balanced DCN groups, the load balancer forwards the received data message to its addressed destination. On the other hand, when the received data message is addressed to one of load balancer's DCN groups, the load balancer identifies a DCN in the addressed DCN group that should receive the data message, and directs the data message to the identified DCN. To direct the data message to the identified DCN, the load balancer in some examples changes the destination address (e.g., the destination IP address, destination port, destination MAC address, etc.) in the data message from the address of the identified DCN group to the address (e.g., the destination IP address) of the identified DCN.
公开号:EP3709600A1
申请号:EP20172139.6
申请日:2014-12-30
公开日:2020-09-16
发明作者:Jayant Jain；Anirban Sengupta；Mohan Parthasarathy；Allwyn Sequeira；Serge Maskalik；Rick Lund
申请人:Nicira Inc；
IPC主号:H04L47-00

专利说明:
[0001] Load balancers are commonly used in datacenters to spread the traffic load to a number of available computing resources that can handle a particular type of traffic. Figures 1 and 2 illustrate two common deployments of load balancers in datacenters today. In Figure 1 , the load balancers 100 are topologically deployed at the edge of the network and between different types of VMs (e.g., between webservers 105 and application servers 110, and between application servers 110 and the database servers 115). The load balancers 100 are in some deployments standalone machines (e.g., F5 machines) that perform load balancing functions. Also, in some deployments, the load balancers are service virtual machines (VMs) that are executing on the same host computing devices that execute the different layers of servers that have their traffic balanced by the load balancers. Figure 2 illustrates one such deployment of load balancers as service VMs (SVMs).
[0002] In the load balancer deployments of Figures 1 and 2 , the load balancers serve as chokepoint locations in the network topology because they become network traffic bottlenecks as the traffic load increases. Also, these deployments require manual configuration of the load balancers and the computing devices that send data packets to these load balancers in order to allow the load balancers to properly receive and distribute the load balanced traffic. These deployments also do not seamlessly grow and shrink the number of the computing devices that receive the load balanced traffic, as the data traffic increases and decreases.
[0003] US 2005/0249199 Albert et al describes load balancing using distributed forwarding agents with application based feedback for different virtual machines. US 2009/0327464 Archer et al describes load balanced data processing performed on an application message transmitted between compute nodes. US 2005/0114429 Caccavale describes load balancing of distributed processing units based on performance metrics. US 2011/0276695 A1 Maldaner describes continuous upgrading of computers in a load balanced environment which receives incoming requests from client computers across a network. US 2010/0281282 Pike et al describes an application efficiency engine in an operating environment but may include one or more load balances. BRIEF SUMMARY
[0004] The present invention is defined according to the independent claims. Additional features will be appreciated from the dependent claims and the description herein. Any embodiments which are described but which do not fall within the scope of the claims are to be interpreted merely as examples useful for a better understanding of the invention..
[0005] By way of introduction, some examples provide a novel method for load balancing data messages that are sent by a source compute node (SCN) to one or more different groups of destination compute nodes (DCNs). In some examples, the method deploys a load balancer in the source compute node's egress datapath. This load balancer receives each data message sent from the source compute node, and determines whether the data message is addressed to one of the DCN groups for which the load balancer spreads the data traffic to balance the load across (e.g., data traffic directed to) the DCNs in the group. When the received data message is not addressed to one of the load balanced DCN groups, the load balancer forwards the received data message to its addressed destination. On the other hand, when the received data message is addressed to one of load balancer's DCN groups, the load balancer identifies a DCN in the addressed DCN group that should receive the data message, and directs the data message to the identified DCN. To direct the data message to the identified DCN, the load balancer in some examples changes the destination address (e.g., the destination IP address, destination port, destination MAC address, etc.) in the data message from the address of the identified DCN group to the address (e.g., the destination IP address) of the identified DCN.
[0006] By employing this inline load-balancing (LB) method, a source compute node does not have to be configured to address certain data messages to load balancers while foregoing such addressing for other data messages. This method can also seamlessly perform load balancing for several different DCN groups. In some examples, the source compute node and DCN group(s) are within one compute cluster in a datacenter. Accordingly, the method of some examples can seamlessly load balance data messages that are sent to one or more DCN groups within a compute cluster from source compute nodes in the compute cluster.
[0007] In some examples, the source compute node is a virtual machine (VM) that executes on a host, and the load balancer is another software module that executes on the same host. Other VMs also execute on the host in some examples. Two or more of the VMs (e.g., all of the VMs) on the host use the same load balancer in some examples, while in other examples, each VM on the host has its own load balancer that executes on the host.
[0008] The host also executes a software forwarding element (SFE) in some examples. The SFE communicatively couples the VMs of the host to each other and to other devices (e.g., other VMs) outside of the host. In some examples, the load balancers are inserted in the egress path of the VMs before the SFE. For instance, in some examples, each VM has a virtual network interface card (VNIC) that connects to a port of the SFE. In some of these examples, the load balancer for a VM is called by the VM's VNIC or by the SFE port to which the VM's VNIC connects. In some examples, the VMs execute on top of a hypervisor, which is a software layer that enables the virtualization of the shared hardware resources of the host. In some of these examples, the hypervisor provides the load balancers that provide the inline load balancing service to its VMs.
[0009] The load balancing method of some examples is implemented in a datacenter that has several hosts executing several VMs and load balancers. In some of these examples, some or all of the load balanced DCNs are other VMs that are executing on the same or different hosts as the SCN VMs. Examples of source and destination compute nodes that can be load balanced by the load balancing method of some examples include data compute end nodes (i.e., source and data compute end nodes) that generate or consume data messages, or middlebox service nodes that perform some type of data processing on the data messages as these messages are being relayed between the data compute end nodes. Examples of data compute end nodes (DCEN) include webservers, application servers, database servers, etc., while example of middlebox service nodes include firewalls, intrusion detection systems, intrusion prevention systems, etc.
[0010] In a multi-host environment of some examples, the load balancers on the host implement a distributed load balancing (DLB) method. This DLB method of some examples involves deploying one or more load balancers on the hosts that execute the SCN VMs. The load balancers on the hosts enforce the load balancing rules needed to spread the data traffic from the SCN VMs on their hosts to the DCNs of one or more DCN groups. In this distributed implementation, each load balancer enforces just the load balancing rules that are applicable to its SCN VM or VMs.
[0011] A set of one or more controllers facilitate the DLB operations of some examples. For instance, in some examples, the load balancers on the hosts collect data traffic statistics based on the data messages that they load balance. These load balancers then pass the collected statistics to the controller set, which aggregates the statistics. In some examples, the controller set then distributes the aggregated statistics to load balancing agents that execute on the hosts. These agents then analyze the aggregated statistics to generate and/or to adjust load balancing criteria that the load balancers (that execute on the same hosts as the agents) enforce. In other examples, the controller set analyzes the aggregated statistics to generate and/or to adjust load balancing criteria, which the controller set then distributes to the hosts for their load balancers to enforce. In still other examples, the controller set generates and distributes some load balancing criteria based on the aggregated statistics, while also distributing some or all aggregated statistics to the hosts so that their LB agents can generate other load balancing criteria.
[0012] Irrespective of the implementation for generating the load balancing criteria, the collection and aggregation of the data traffic statistics allows the load balancing criteria to be dynamically adjusted. For instance, when the statistics show that one DCN is too congested with data traffic, the load balancing criteria can be adjusted dynamically to reduce the load on this DCN while increasing the load on one or more DCNs in the same DCN group. In some examples, the collection and aggregation of the data traffic statistics also allows the DLB method to reduce the load in any load balanced DCN group by dynamically instantiating or allocating new DCN VMs for the DCN group or by instantiating or allocating new SCN VMs.
[0013] The preceding Summary is intended to serve as a brief introduction to some examples of the invention. It is not meant to be an introduction or overview of all inventive subject matter disclosed in this document. The Detailed Description that follows and the Drawings that are referred to in the Detailed Description will further describe the examples described in the Summary as well as other examples. Accordingly, to understand all the examples described by this document, a full review of the Summary, Detailed Description, the Drawings and the Claims is needed. Moreover, the claimed subject matters are not to be limited by the illustrative details in the Summary, Detailed Description and the Drawing. BRIEF DESCRIPTION OF THE DRAWINGS
[0014] The novel features of the invention are set forth in the appended claims. However, for purposes of explanation, several examples of the invention are set forth in the following figures. Figures 1 and 2 illustrate two common deployments of load balancers in datacenters today. Figure 3 illustrates a load balancing architecture that employs the inline load-balancing method of some examples. Figure 4 illustrates an example of inline load balancers. Figure 5 illustrates an example of a controller set that gathers statistics from hosts and based on the gathered statistics, dynamically adjusts the load balancing operations. Figure 6 illustrates a more detailed architecture of a host that executes the load balancing modules of some examples of the invention. Figures 7 and 8 presents examples of load balancing rules of some examples. Figure 9 illustrates a process that a load balancer performs in some examples. Figures 10 and 11 illustrate two processes that a load balancing agent performs in some examples. Figure 12 illustrates a process that a controller set performs in some examples. Figure 13 illustrates a process that shows the operation of the controller set for examples in which the controller set analyzes the membership updates and/or global statistics, and in response to this analysis specifies and/or updates LB rules if needed. Figures 14-16 present several examples that illustrate how some examples dynamically adjust the spreading of traffic by adjusting the load balancing criteria and by adding/removing DCN VMs. Figure 17 illustrates that the distributed load balancing architecture of some examples can be used to load balance the data traffic to and from middleboxes. Figure 18 presents an example that illustrates one VM's inline load balancer forming multiple distributed load balancers with multiple other inline load balancers of other VMs. Figure 19 presents an example to illustrate that the distributed load balancers of some examples can differently translate the virtual addresses of data messages to different groups of DCNs. Figure 20 illustrates a set of distributed load balancers that direct webserver data messages to a group of application servers to either a high-priority sub-group of application servers or a low priority group of application servers based on the assessed priority of the data messages. Figure 21 conceptually illustrates a computer system with which some examples of the invention are implemented. DETAILED DESCRIPTION
[0015] In the following detailed description of the invention, numerous details, examples, and examples of the invention are set forth and described. However, it will be clear and apparent to one skilled in the art that the invention is not limited to the examples set forth and that the invention may be practiced without some of the specific details and examples discussed.
[0016] Some examples provide a novel method for load balancing data messages that are sent by a source compute node (SCN) to one or more different groups of destination compute nodes (DCNs). In some examples, the method deploys a load balancer in the source compute node's egress datapath. This load balancer receives each data message sent from the source compute node, and determines whether the data message is addressed to one of the DCN groups for which the load balancer spreads the data traffic to balance the load across (e.g., data traffic directed to) the DCNs in the group. When the received data message is not addressed to one of the load balanced DCN groups, the load balancer forwards the received data message to its addressed destination. On the other hand, when the received data message is addressed to one of load balancer's DCN groups, the load balancer identifies a DCN in the addressed DCN group that should receive the data message, and directs the data message to the identified DCN. To direct the data message to the identified DCN, the load balancer in some examples changes the destination address (e.g., the destination IP address) in the data message from the address of the identified DCN group to the address (e.g., the destination IP address, destination port, destination MAC address, etc.) of the identified DCN.
[0017] Examples of source and destination compute nodes that can be load balanced by the method of some examples include data compute end nodes (i.e., source and data compute end nodes) that generate or consume data messages, or middlebox service nodes that perform some data processing on the data messages that are relayed between the data compute end nodes. Examples of data compute end nodes (DCEN) include webservers, application servers, database servers, etc., while example of middlebox service nodes include firewalls, intrusion detection systems, intrusion protection systems, etc. Also, as used in this document, a data message refers to a collection of bits in a particular format sent across a network. One of ordinary skill in the art will recognize that the term data message may be used herein to refer to various formatted collections of bits that may be sent across a network, such as Ethernet frames, IP packets, TCP segments, UDP datagrams, etc.
[0018] By employing the inline load-balancing (LB) method of some examples, a source compute node does not have to be configured to address certain data messages to load balancers while foregoing such addressing for other data messages. In some examples, the service gets deployed for a SCN automatically when the SCN is deployed as a virtual machine on a host, and the VM deployment process configures the load balancing criteria for the VM. This method can also seamlessly perform load balancing for several different DCN groups. In some examples, the SCNs and the DCNs are within one compute cluster in a datacenter. Accordingly, the method of some examples can seamlessly load balance data messages that are sent to one or more DCN groups within a compute cluster from other source compute nodes in the compute cluster.
[0019] Figure 3 illustrates a load balancing architecture 300 that employs the inline load-balancing method of some examples. This architecture is a distributed load balancing (DLB) architecture that has a load balancer 305 in the egress datapath of each of several compute nodes. The compute nodes in this example fall into three groups of servers, which are web servers 310, application servers 315, and database servers 320. In some examples, the three groups of servers are three-tiers of servers that are commonly found in adatacenter.
[0020] As shown, a load balancer 305 is placed at the output of each web or application server in this example, so that webserver data traffic to the application servers is load balanced, and the application server data traffic to the database servers is load balanced. Each load balancer enforces the load balancing rules needed to spread the data traffic that is sent from the load balancer's corresponding source compute node (e.g., source servers) to multiple destination compute nodes (e.g., destination servers) that are part of one DCN group. In other words, this distributed implementation allows each load balancer to enforce just the load balancing rules that are applicable to its source compute node. Also, this distributed architecture does not have any load balancer that is a chokepoint as it receives too much data messages from one or more source compute nodes that prevent it from timely spreading the data messages from another source compute node.
[0021] In some examples, some or all of the source and destination compute nodes are virtual machines (VMs) that executes on a host, and some or all of the load balancers are other software module that executes on the same hosts as their source compute nodes. Figure 4 illustrates an example in which the load balancers 305 and the three groups of servers 310, 315, and 320 of Figure 3 are executing on six hosts 405-430 in a datacenter. In the example illustrated in Figure 4 , one LB executes on each host for each web or application server that needs some of its data messages load balanced. In other examples, however, one load balancer on a host load balances the output data messages of two or more of the VMs (e.g., all of the VMs) on the host. Even under this architecture that uses one load balancer for two or more SCN VMs, the load balancers implement a DLB scheme as each load balancer enforces just the load balancing rules that are applicable to SCN VM or VMs on that host.
[0022] Figure 4 illustrates that in addition to the VMs and load balancers that execute on the hosts, each host also executes a software forwarding element (SFE) 435 in some examples. The SFE 435 on a host communicatively couples the VMs of the host to each other and to other devices outside of the host (e.g., VMs on other hosts) through one or more other forwarding elements (e.g., one or more switches and routers) outside of the host. Examples of SFEs include software switches, software routers, etc.
[0023] As shown in Figure 4 , the load balancers in some examples are inserted in the egress path of the VMs before the SFE. For instance, in some examples, each VM has a virtual network interface card (VNIC) that connects to a port of the SFE. In some of these examples, the load balancer for a VM is called by the VNIC of the VM or by the SFE port to which the VM's VNIC connects. In some examples, the VMs execute on top of a hypervisor, which is a software layer that enables the virtualization of the shared hardware resources of the host. In some of these examples, the hypervisors provide the load balancers that provide the inline load balancing service to its VMs.
[0024] Figure 4 also shows each host to have two data storages 440 and 445. The first data storage is an LB rule data storage 440 (e.g., database), while the second data storage is a STAT data storage 445. In some examples, the host's data storage 440 stores LB rules that specify the IP addresses of the DCN VMs of the DCN groups that are load balanced by the host's load balancers. In some examples, the LB rule storages 440 not only store the IP addresses of the DCN VMs but also stores the load balancing criteria (metrics) that the load balancers use to load balance the data traffic. While one LB rule storage 440 is shown for all load balancers 305 in Figure 4 , one of ordinary skill in the art will realize that in other examples each load balancer 305 has its own LB rule storage 440.
[0025] In some examples, a SCN VM sends a data message to a virtual address (e.g., a virtual IP (VIP) address) that is associated with a load balanced DCN group. Before this data message is processed by the SFE of the VM's host, the SCN VM's load balancer intercepts the data message and determines that it is addressed to a DCN group (e.g., determines that the message's destination IP address is the VIP of a DCN group) whose input data should be load balanced by the load balancer. The load balancer then replaces the virtual address in the data message with a DCN VM's physical address (e.g., the VM's IP address) that is stored in the LB rule storage 440. The changing of the destination virtual address to a DCN VM's physical address is a form of destination network address translation. As the virtual address is replaced by a physical address, the virtual address does not have to be routed out of the host, which simplifies the deployment of the load balancing scheme.
[0026] In selecting the DCN VM that should receive the data message, the load balancer in some examples uses the load balancing criteria that is stored in the LB rule storage 440. After changing the network address of the received data message, the load balancer supplies the data message to the SFE for it to process so that the data message can reach the addressed DCN VM. One intrinsic advantage of this approach is that no source address translation (e.g., source NAT) is required because the traffic comes back to the SCN VM that generated the traffic.
[0027] The STAT data storage 445 stores statistics regarding the load balanced data messages. For instance, as the load balancers 305 spread the data messages to one or more load balanced DCN groups, the load balancers in some examples store statistics about how many data messages and/or how many data flows are being sent to each DCN in each load balanced DCN group. In other examples, the load balancers store other statistics, as further described below. While one STAT data storage 445 is shown for all load balancers 305 in Figure 4 , one of ordinary skill in the art will realize that in other examples each load balancer 305 has its own STAT data storage 445.
[0028] In some examples, the statistics that are stored in the STAT data storage 445 on each host are passed to a set of one or more LB controllers that facilitate the DLB operations of some examples. The controller set then aggregates the statistics that it receives from each host. The controller set then (1) distributes the aggregated statistics to each host so that each host can define and/or adjust its load balancing criteria, and/or (2) analyzes the aggregated statistics to specify and distribute some or all of the load balancing criteria for the load balancers to enforce. In this manner, the load balancing criteria can be dynamically adjusted based on the statistics that are stored in the STAT data storage 445.
[0029] In some examples, the controller set also dynamically instantiates or allocates VMs to SCN or DCN groups in order to reduce the load in any load balanced DCN group. The controller set can also dynamically instantiate or allocate VMs to SCN or DCN groups when it detects that a VM in one of these groups has crashed or has other operational issues. In such circumstances, the load balancing operations of the distributed load balancers can be adjusted in order to use the newly instantiated or allocated VM, and to reduce or eliminate the use of the VM that has crashed or has operational issues.
[0030] Figure 5 illustrates an example of a controller set that gathers statistics from hosts and based on the gathered statistics, dynamically adjusts the load balancing operations. Specifically, this figure illustrates a multi-host system 500 of some examples. As shown, this system includes multiple virtualized hosts 505-515, a set of load balancing (LB) controllers 520, and a set of one or more VM managing controllers 525. As shown in Figure 5 , the hosts 505-515, the LB controller set 520, and the VM manager set 525 communicatively couple through a network 575, which can include a local area network (LAN), a wide area network (WAN) or a network of networks (e.g., Internet).
[0031] The VM managing controllers 525 provide control and management functionality for defining (e.g., allocating or instantiating) and managing one or more VMs on each host. These controllers in some examples also provide control and management functionality for defining and managing multiple logical networks that are defined on the common software forwarding elements of the hosts. In some examples, the hosts 505-515 are similar to the hosts 405-430 of Figure 4 , except that the hosts 505-515 each are shown to include an LB agent 560 for interacting with the LB controller set 520, while not showing the other components of the hosts, such as LB and STAT data storages 440 and 445. The LB agents 560 gather the collected statistics from the STAT data storage 445, and relay these statistics to the LB controller set 520. In some examples, the LB agents 560 aggregate and/or analyze some of the statistics before relaying processed statistics to the LB controller set, while in other examples the LB agents relay collected raw statistics to the LB controller set.
[0032] The LB controller set 520 aggregates the statistics that it receives from the LB agents of the hosts. In some examples, the LB controller set 520 then distributes the aggregated statistics to the LB agents that execute on the hosts. These agents then analyze the aggregated statistics to generate and/or to adjust LB rules or criteria that the load balancers that execute on the same hosts as the agents enforce.
[0033] In other examples, the controller set analyzes the aggregated statistics to generate and/or to adjust LB rules or criteria, which the controller set then distributes to the hosts for their load balancers to enforce. In some of these examples, the controller set distributes the same LB rules and/or criteria to each load balancer in a group of associated load balancers (i.e., in a group of load balancers that distribute the data messages amongst the DCNs of a group of DCNs), while in other examples, the controller distributes different LB rules and/or criteria to different load balancers in the group of associated load balancers. Also, in some examples, the controller set distributes updated LB rules and/or criteria to some of the load balancers in an associated group of load balancers, while not distributing the updated LB rules and/or criteria to other load balancers in the associated group.
[0034] In still other examples, the controller set generates and distributes some load balancing rules or criteria based on the aggregated statistics, while also distributing some or all aggregated statistics to the hosts so that their LB agents can generate other load balancing rules or criteria. One of ordinary skill in the art will realize that the LB rules and/or criteria are not always adjusted based on the aggregated statistics. Rather the LB rules and/or criteria are modified only when the aggregated statistics require such modification.
[0035] Irrespective of the implementation for generating the LB rules, the collection and aggregation of the data traffic statistics allows the LB rules or criteria to be dynamically adjusted. For instance, when the statistics show one DCN as being too congested with data traffic, the LB rules or criteria can be adjusted dynamically for the load balancers of the SCNs that send data messages to this DCN's group, in order to reduce the load on this DCN while increasing the load on one or more other DCNs in the same DCN group. In some examples, the collection and aggregation of the data traffic statistics also allows the LB controller set 520 to reduce the load on any DCN in a load balanced DCN group by dynamically directing the VM managing controller set 525 to instantiate or allocate new DCN VMs for the DCN group or by instantiating or allocating new SCN VMs.
[0036] Figure 6 illustrates a more detailed architecture of a host 600 that executes the load balancing modules of some examples of the invention. As shown, the host 600 executes multiple VMs 605, an SFE 610, a set of one or more load balancers 615, an LB agent 620, and a publisher 622. The host also has LB rule storage 440 and the STAT data storage 445, as well as group membership data storage 684, policy data storage 682, aggregated (global) statistics data storage 686, and connection state storage 690.
[0037] The SFE 610 executes on the host to communicatively couple the VMs of the host to each other and to other devices outside of the host (e.g., other VMs on other hosts) through one or more forwarding elements (e.g., switches and/or routers) that operate outside of the host. As shown, the SFE 610 includes a port 630 to connect to a physical network interface card (not shown) of the host, and a port 635 to connect to the VNIC 625 of each VM. In some examples, the VNICs are software abstractions of the physical network interface card (PNIC) that are implemented by the virtualization software (e.g., by a hypervisor). Each VNIC is responsible for exchanging data messages between its VM and the SFE 610 through its corresponding SFE port. As shown, a VM's egress datapath for its data messages includes (1) the VM's VNIC 625, (2) the SFE port 635 that connects to this VNIC, (3) the SFE 610, and (4) the SFE port 630 that connects to the host's PNIC.
[0038] Through its port 630 and a NIC driver (not shown), the SFE 610 connects to the host's PNIC to send outgoing packets and to receive incoming packets. The SFE 610 performs message-processing operations to forward messages that it receives on one of its ports to another one of its ports. For example, in some examples, the SFE tries to use header values in the VM data message to match the message to flow based rules, and upon finding a match, to perform the action specified by the matching rule (e.g., to hand the packet to one of its ports 630 or 635, which directs the packet to be supplied to a destination VM or to the PNIC). In some examples, the SFE extracts from a data message a virtual network identifier and a MAC address. The SFE in these examples uses the extracted VNI to identify a logical port group, and then uses the MAC address to identify a port within the port group. In some examples, the SFE 610 is a software switch, while in other examples it is a software router or a combined software switch/router.
[0039] The SFE 610 in some examples implements one or more logical forwarding elements (e.g., logical switches or logical routers) with SFEs executing on other hosts in a multi-host environment. A logical forwarding element in some examples can span multiple hosts to connect VMs that execute on different hosts but belong to one logical network. In other words, different logical forwarding elements can be defined to specify different logical networks for different users, and each logical forwarding element can be defined by multiple SFEs on multiple hosts. Each logical forwarding element isolates the traffic of the VMs of one logical network from the VMs of another logical network that is serviced by another logical forwarding element. A logical forwarding element can connect VMs executing on the same host and/or different hosts.
[0040] The SFE ports 635 in some examples include one or more function calls to one or more modules that implement special input/output (I/O) operations on incoming and outgoing packets that are received at the ports. One of these function calls for a port is to a load balancer in the load balancer set 615. In some examples, the load balancer performs the load balancing operations on outgoing data messages that are addressed to DCN groups whose input traffic is being spread among the DCNs in the group in order to reduce the load on any one DCN. For the examples illustrated by Figure 6 , each port 635 has its own load balancer 615. In other examples, some or all of the ports 635 share the same load balancer 615 (e.g., all the ports share one load balancer, or all ports that are part of the same logical network share one load balancer).
[0041] Examples of other I/O operations that are implemented by the ports 635 include firewall operations, encryption operations, message encapsulation operations (e.g., encapsulation operations needed for sending messages along tunnels to implement overlay logical network operations), etc. By implementing a stack of such function calls, the ports can implement a chain of I/O operations on incoming and/or outgoing messages in some examples. Instead of calling the I/O operators (including the load balancer set 615) from the ports 635, other examples call these operators from the VM's VNIC or from the port 630 of the SFE.
[0042] The load balancers 615 perform their load balancing operations based on the LB rules that are specified in the LB rule storage 440. For a virtual address (e.g., VIP) of a load balanced DCN group, the LB rule storage 440 stores a load balancing rule that specifies two or more physical addresses (e.g., IP addresses) of DCNs of the group to which a data message can be directed. In some examples, this load balancing rule also includes load balancing criteria for specifying how the load balancer should spread the traffic across the DCNs of the group associated with a virtual address.
[0043] One example of such load balancing criteria is illustrated in Figure 7 , which presents examples of load balancing rules that are stored in the LB rule storage 440. As shown, this data storage includes multiple LB rules 700, with each LB rule associated with one load balanced DCN group. In this example, each load balance rule includes (1) a set of data-message identifying tuples 705, (2) several IP addresses 710 of several DCNs of the load balanced DCN group, and (3) a weight value 715 for each IP address.
[0044] Each rule's tuple set 705 includes the VIP address (as the destination IP address) of the rule's associated DCN group. In some examples, the tuple set 705 only includes the VIP address. In other examples, the tuple set also includes other data message identifiers, such as source IP address, source port, destination port, and protocol, which together with the destination IP address form the five-tuple header values. In some examples, a load balancer searches a LB data storage by comparing one or more message identifier values (e.g., the destination IP address, or one or more of the five-tuple header values) to the rule tuple sets 705 to identify a rule that has a tuple set that matches the message identifier values.
[0045] Each LB rule's IP addresses 710 are the IP addresses of the DCNs that are members of the DCN group that has the VIP address specified in the rule's tuple set 705. In some examples, the addresses of the DCNs are supplied as a part of the data initially supplied by the controller set (e.g., in order to configure the load balancer) or are supplied in subsequent updates to the DCN group information that is provided by the controller set.
[0046] The weight values 715 for the IP addresses of each LB rule provides the criteria for a load balancer to spread the traffic to the DCNs that are identified by the IP addresses. For instance, in some examples, the load balancers use a weighted round robin scheme to spread the traffic to the DCNs of the load balanced DCN group. As one example, assume that the DCN group has five DCNs and the weight values for the IP addresses of these DCNs are 1, 3, 1, 3, and 2. Based on these values, a load balancer would distribute data messages that are part of ten new flows as follows: 1 to the first IP address, 3 to the second IP address, 1 to the third IP address, 3 to the fourth IP address, and 2 to the fifth IP address.
[0047] As further described below, the weight values for an LB rule are generated and adjusted by the LB agent 620 and/or LB controller set in some examples based on the LB statistics that the load balancers store in the STAT data storage 445. To gracefully switch between different load balancing criteria, the LB rules in some examples specify time periods for different load balancing criteria of a LB rule that are valid for different periods of time.
[0048] Figure 8 illustrates an example of load balancing rules 800 with such time period parameters. These LB rules are stored in the LB rule storage 440 in some examples. Each LB rule 800 has one message identifying tuple 805, one or more IP address sets 810, and one or more weight value sets 815. Each IP address set 810 has two or more IP addresses, and each weight value set 815 is associated with an IP address set and has one weight value for each IP address in its associated IP address set.
[0049] In the example illustrated in Figure 8 , each rule has multiple sets of IP addresses and multiple sets of weight values. Each set of IP addresses and its associated set of weight values represents one set of load balancing criteria. For each of these sets of load balancing criteria, each rule has a time value 820 that specifies the time period during which the IP address set 810 and its associated weight value set 815 are valid. For instance, in a LB rule, the time value for one IP address set might specify "before 1 pm on 9/1/2014," while the time value for another IP address set might specify "after 12:59 pm on 9/1/2014." These two time periods allow the load balancers to seamlessly switch from using one IP address set and its associated weight value set to another IP address set and its associated weight value set at 1 pm on 9/1/2014. These two IP address sets might be identical and they might only differ in their associated weight value sets. Alternatively, the two IP address sets might be different. Two IP address sets might differ but have overlapping IP addresses (e.g., one set might have five IP addresses, while another set might have four of these five IP addresses when one DCN is added or removed from a DCN group). Alternatively, two IP address sets might differ by having no IP addresses in common.
[0050] In Figure 8 , the time period values and the weight values are used in the LB rules. One of ordinary skill in the art will realize that in other examples, the LB rules do include the weight values, but include the time values to allow the load balancer to gracefully switch between different sets of load balanced DCNs. As before, two DCN sets may differ by having mutually exclusive DCNs, or they may differ by having one or more DCNs in common and one or more DCNs not in common.
[0051] As shown in Figure 6 , the host also includes a connection state storage 690 in which the load balancer stores data records that allow the load balancer to maintain connection state for data messages that are part of the same flow, and thereby to distribute data messages that are part of the same flow statefully to the same DCN. More specifically, whenever a load balancer identifies a DCN for a data message based on the message's group destination address (e.g., the destination VIP), the load balancer not only replaces the group destination address with the DCN's address (e.g., with the DCN IP address), but also stores a record in the connection state storage 690 to identify the DCN for subsequent data messages that are part of the same flow. This record stores the destination IP address of the identified DCN along with the data message's header values (e.g., the five tuple values). In some examples, for fast access, the connection data storage 690 is hash indexed based on the hash of the data message header values.
[0052] To identify a DCN for a received data message, the load balancer first checks the connection state storage 690 to determine whether it has previously identified a DCN for receiving data messages that are in the same flow as the received message. If so, the load balancer uses the DCN that is identified in the connection state storage. Only when the load balancer does not find a connection record in the connection state storage 690, the load balancer in some examples examines the LB rules in the LB rule storage 440 in order to identify a DCN to receive the data message.
[0053] By searching the connection state storage 690 with the message identifiers of subsequent data messages that are part of the same flow, the load balancer can identify the DCN that it previously identified for a data message of the same flow, in order to use the same DCN for the messages that are part of the same flow (i.e., in order to statefully perform its load balancing operation). In some examples, the load balancer also uses the connection state storage 690 records to replace the DCN's destination address with the virtual group address (e.g., the group VIP address) on the reverse flow path when the load balancer receives (from the SFE port 630 or 635) data messages sent by the DCN to the SCN. After translating of the destination addresses of a data message in the reverse flow, the load balancer returns the data message to the SFE port that called it, so that the SFE port can direct the data message to SCN VM.
[0054] In some examples, the connection state storage 690 is addressed differently than the LB data storage 440. For instance, as mentioned above, the connection state storage 690 in some examples stores its connection-state records based on hashed message identifier values (e.g., five tuple identifier values), while not using such a hash addressing scheme for the LB rule data storage 440. In some examples, the hashed values specify memory locations in the connection state storage 690 that store the corresponding message-identifier sets. Because of this addressing scheme, the load balancer generates a hash of the message-identifier set to identify one or more locations in the connection state storage 690 to examine for a matching message-identifier set. In other examples, the LB rule data storage 440 is also hash indexed based on the hash of the tuple set 705.
[0055] In Figure 6 , only one LB rule data storage 440 and only one connection state storage 690 are illustrated for all the load balancers 615. In other examples, each load balancer has its own rule data storage 440 and connection state storage 690. In yet other examples, the host has several rule data storages 440 and connection state storages 690, but two or more load balancers can share a rule data storage or connection state storage (e.g., two load balancers that are balancing the load for two VMs that are part of the same logical network). As further described below by reference to Figure 18, each load balancer 615 having its own rule data storage 440 and connection state storage 690 allows these storages to be smaller and easier to search more quickly.
[0056] In some examples, each time a load balancer 615 performs a load balancing operation on a data message (i.e., replaces the destination virtual address of the message to a destination address of a DCN), the load balancer updates the statistics that it maintains in the STAT data storage 445 for the data traffic that it relays to the DCN that was addressed as part of its load balancing operation. Several examples of statistics were provided above and will be further described below.
[0057] In some examples, the LB agent 620 gathers (e.g., periodically collects) the statistics that the load balancers store in the STAT data storage(s) 445, and relays these statistics to the LB controller set 520. Based on statistics that the LB controller set 520 gathers from various LB agents of various hosts, the LB controller set (1) distributes the aggregated statistics to each host's LB agent so that each LB agent can define and/or adjust its load balancing criteria, and/or (2) analyzes the aggregated statistics to specify and distribute some or all of the load balancing criteria for the load balancers to enforce.
[0058] In some examples where the LB agent receives new load balancing criteria from the LB controller set, the LB agent stores these criteria in the host-level LB rule storage 688 for propagation to the LB rule storage(s) 440. In the example where the LB agent receives aggregated statistics from the LB controller set, the LB agent stores the aggregated statistics in the global statistics data storage 686. In some examples, the LB agent 620 analyzes the aggregated statistics in this storage 686 to define and/or adjust the load balancing criteria (e.g., weight values), which it then stores in the LB rule storage 688 for propagation to the LB rule storage(s) 440. The publisher 622 retrieves each LB rule that the LB agent 620 stores in the LB rule storage 688, and stores the retrieved rule in the LB rule storage 440 of the load balancer 615 that needs to enforce this rule.
[0059] The LB agent 620 not only propagates LB rule updates based on newly received aggregated statistics, but it also propagates LB rules or updates LB rules based on updates to DCN groups that it receives from the LB controller set 520. The LB agent 620 stores each DCN group's members that it receives from the LB controller set 520 in the group data storage 684. When a DCN is added or removed from a DCN group, the LB agent 620 stores this update in the group storage 684, and then formulates updates to the LB rules to add or remove the destination address of this DCN to or from the LB rules that should include or already include this address. Again, the LB agent 620 stores such updated rules in the rule data storage 688, from where the publisher propagates them to the LB rule storage(s) 440 of the load balancers that need to enforce these rules.
[0060] When a DCN is added to a DCN group, the updated LB rules cause the load balancers to direct some of the DCN-group data messages to the added DCN. Alternatively, when a DCN is removed from a DCN group, the updated LB rules cause the load balancers to re-direct data messages that would go to the removed DCN, to other DCNs in the group. However, even after a DCN is intentionally designated for removal from a DCN group, a load balancer in some examples may continue to send data messages (e.g., for a short duration of time after the removal of the DCN) to the DCN that are part of prior flows that were directed to the DCN. This allows the DCN to be removed gradually and gracefully from the DCN group as the flows that it handles terminate. Some examples also achieve a graceful transition away from a DCN that should be removed from the DCN group by using time values to specify when different LB criteria for the same LB rule should be used. Some examples also use such time values to gracefully add a new DCN to a DCN group.
[0061] In some examples, the LB agent 620 stores in the policy storage 682, LB policies that direct the operation of the LB agent in response to newly provisioned DCN VMs and their associated load balancers, and/or in response to updated global statistics and/or adjusted DCN group membership. The policies in the policy storage 682 in some examples are supplied by the LB controller set 520.
[0062] Figure 9 illustrates a process 900 that the load balancer 615 performs in some examples. As shown, the process 900 starts when the load balancer receives (at 905) a data message from its corresponding SFE port 635. This port relays this message when it receives the data message from its VM. In some examples, the port relays the data message by passing to the load balancer a reference (e.g., a handle that identifies a location in memory that stores the data message) to the data message.
[0063] Next, the process determines (at 910) whether the received data message's destination address is a virtual address (e.g., the VIP address) of a DCN group that the load balancer has to balance its input. To make this determination, the process 900 checks a table in the LB rule data storage 440 that stores the virtual addresses of the DCN groups that the process load balances.
[0064] When the process determines (at 910) that the data message is not directed to a load balanced virtual address, the process sends (at 915) the message along the message's datapath without performing any destination address translation on the message. This operation (at 915) entails informing the SFE port 635 that called it, that the process has completed processing the VM data message. The SFE port 635 can then handoff the VM data message to the SFE 610 or can call another I/O chain operator to perform another operation on the VM data message. After 915, the process ends.
[0065] On the other hand, when the process determines (at 910) that the data message is directed to a load balanced virtual address, the process determines (at 920) whether the connection state cache 690 stores a record that identifies the DCN to which the data message should be routed. As mentioned above, each time a load balancer uses a LB rule to direct a new data message flow a DCN of a DCN group, the load balancer in some examples creates a record in the connection state cache 690 to store the physical IP address of the DCN, so that when the load balancer receives another data message within the same flow (i.e., with the same message-attribute set), it can route it to the same DCN that it used for previous data message in the same flow.
[0066] Also, as mentioned above, the connection-state cache 690 in some examples stores each flow's record based on hashed address values that are hashed versions of the flow identifying attributes of the data message header values. This addressing scheme allows the load balancer to quickly search the cache 690. Hence, before searching the rule data store 440, the load balancer first generates a hash value from the message-attribute set of the received data message (e.g., a hash of the message's five tuples) to identify one or more memory locations in the cache 690, and then uses this hash value to examine the memory location(s) to determine whether the cache stores a connection-flow record with a matching set of attributes as the received VM data message.
[0067] When the process 900 identifies (at 920) a record for the received data message's flow in the cache 690, the process (at 925) then replaces the message's destination address (i.e., the virtual group address, such as the VIP address) with the DCN destination address (e.g., with the DCN IP address) that is stored in the record in the cache 690. At 925, the process sends the address-translated data message along its datapath. In some examples, this operation entails returning a communication to the SFE port 635 (that called the load balancer to initiate the process 900) to let the port know that the load balancer is done with its processing of the VM data message. The SFE port 635 can then handoff the data message to the SFE 610 or can call another I/O chain operator to perform another operation on the data message. At 925, the process 900 also updates in some examples the statistics that it maintains in STAT storage 445 for the DCN to which the message was addressed by the process 900. This update reflects the transmission of a new data message to this DCN. After 925, the process 900 ends.
[0068] When the process 900 determines (at 920) that the connection cache 690 does not store a record for the received data message's flow, the process 900 searches (at 930) the LB rule data store 440 to identify an LB rule for the data message received at 905. To identify the LB rule in the data store 440, the process in some examples compares a set of attributes of the received data message with the data-message identifying tuples (e.g., tuples 705 of Figure 7 ) of the rules to identify a rule that has a tuple set that matches the message's attribute set. In some examples, the process uses different message-attribute sets to perform this comparison operation. For instance, in some examples, the message attribute set includes just the destination IP address of the message (e.g., the VIP of the addressed DCN group), which was used at 910 to determine whether the message is directed to a load balanced DCN group. In other examples, the message attribute set includes other attributes, such as one or more of the other five-tuple identifiers (e.g., one or more of the source IP, source port, destination port, and protocol). In some examples, the message attribute set includes logical network identifiers such as virtual network identifier (VNI), virtual distributed router identifier (VDRI), a logical MAC address, a logical IP address, etc.
[0069] As mentioned above, each LB rule in some examples includes two or more destination addresses (e.g., IP addresses 710), which are the destination addresses (e.g., IP addresses) of the DCNs that are members of the DCN group that has the virtual address (e.g., VIP address) specified in the rule's tuple set 705. When the process identifies an LB rule (at 930), it selects one of the destination addresses (e.g., IP addresses) of the rule to replace the virtual address (e.g., the VIP address) in the message. Also, as mentioned above, each LB rule stores criteria for facilitating the process' selection of one of the destination addresses of the LB rule to replace the message's virtual destination identifier. In some examples, the stored criteria are the weight and/or times values that were described above by reference to Figures 7 and 8 . Accordingly, in some examples, the process 900 selects one of the matching rule's destination addresses based on the selection criteria stored in the rule.
[0070] After changing the destination address of the data message, the process (at 935) sends the data message along its datapath. Again, in some examples, this operation entails returning a communication to the SFE port 635 (that called the load balancer to initiate the process 900) to let the port know that the load balancer is done with its processing of the data message. The SFE port 635 can then handoff the VM data message to the SFE 610 or can call another I/O chain operator to perform another operation on the VM data message.
[0071] After 935, the process transitions to 940, where in the connection cache data store 690, it creates a record to identify the DCN (i.e., to identify the DCN destination identifier) to use to forward data messages that are part of the same flow as the data message received at 905. In some examples, this record is addressed in the cache 690 based on a hash value of the message-attribute set identified at 905. At 940, the process 900 also updates the statistics that it maintains in STAT storage 445 for the DCN to which the message was addressed by the process 900. This update reflects the transmission of a new data message to this DCN. After 940, the process ends.
[0072] Figures 10 and 11 illustrate two processes that the LB agent 620 performs in some examples. Figure 10 illustrates a process 1000 that the LB agent 620 performs each time that it receives updated group memberships and/or global statistics from the LB controller set 520. As shown, the process 1000 starts (at 1005) when it receives from the LB controller set 520 updated statistics for at least one DCN group and/or updated membership to at least one DCN group.
[0073] Next, the process 1000 determines (at 1010) whether the received update includes an update to the membership of at least one DCN group for which the LB agents generates and/or maintains the LB rules. If not, the process transitions to 1020. Otherwise, the process creates and/or updates (at 1015) one or more records in the group membership storage 684 to store the updated group membership that the process received at 1005. From 1015, the process transitions to 1020.
[0074] At 1020, the process 1000 determines whether the received update includes updated statistics for at least one DCN group for which the LB agents generates and/or maintains the LB rules. If not, the process transitions to 1030. Otherwise, the process creates and/or updates (at 1025) one or more records in the global statistics storage 686 to store the updated global statistics that the process received at 1005. From 1025, the process transitions to 1030.
[0075] At 1030, the process initiates a process to analyze the updated records in the group membership storage 684 and/or the global statistics storage 686 to update the group memberships (e.g., the IP addresses) and/or the load balancing criteria (e.g., the weight or time values) of one or more LB rules in the host-level LB rule data storage 688. This analyzing process will be further described below by reference to Figure 11 . From the host-level LB rule data storage 688, the publisher 622 propagates each new or updated LB rule to the LB rule data storage(s) 640 of the individual load balancer(s) 615 (on the same host) that needs to process the new or updated LB rule. In publishing each new or updated LB rule, the publisher 622 does not publish the LB rule to the rule data storage 640 of a load balancer (on the same host) that does not need to process the rule.
[0076] After 1030, the process 1000 ends.
[0077] Figure 11 illustrates a process 1100 that the LB agent 620 performs in some examples to analyze updated records in the group membership storage 684 and/or the global statistics storage 686, in order to update the group memberships (e.g., the IP addresses) and/or the load balancing criteria (e.g., the weight or time values) of one or more LB rules in the host-level LB rule data storage 688. In some examples, the LB agent performs an identical or similar process when the LB agent powers up (e.g., when its host powers up) to configure the LB rules of the load balancers on the host, and when a new SCN VM is instantiated on the host to configure the LB rules of the instantiated VM's load balancer.
[0078] As shown, this process 1100 initially selects (at 1105) a load balancer 615 on the LB agent's host. In some examples, the process selects (at 1105) only load balancers that are affected by one or more of the updated records that resulted in the performance of this process. Next, at 1110, the process selects a virtual address (e.g., a VIP) of a DCN group that the selected load balancer has to load balance. The process then retrieves (at 1115) the stored statistics and group membership data for the DCN group identified by the selected virtual address.
[0079] At 1120, the process analyzes the membership and statistic records retrieved at 1115. Based on this analysis, the process determines whether the group memberships (e.g., the IP addresses) and/or the load balancing criteria (e.g., the weight or time values) of one or more LB rules in the host-level LB rule data storage 688 should be specified and/or modified for the selected load balancer. To perform this analysis, the process 1100 uses one or more policies that are specified in the policy storage 682. If the process determines that it should specify or update the group's membership and/or the load balancing criteria for the selected group, the process performs (at 1120) this specifying or updating, and then stores (at 1125) the specified or updated the group's membership and/or load balancing criteria in one or more LB rules that are stored in the LB data storage 688. As mentioned above, the specified or updated LB rules in the host LB rule storage 688 are distributed by the publisher 622 to the LB data storage 440 of any load balancer that on the same host performs load balancing operations on the input traffic to the selected group. Several examples of updating load balancing criteria and/or group membership will be described below.
[0080] After 1125, the process determines (at 1130) whether it has examined all virtual group identifiers (i.e., all the DCN groups) that the selected load balancer has to load balance. If not, it selects (at 1135) another virtual group identifier (i.e., another DCN group) and returns to 1115 to perform operations 1115-1130 for this newly selected virtual group identifier. Otherwise, the process transitions to 1140, where it determines whether it has examined the updates for all the load balancers (e.g., whether it has examined all the load balancers affected by the new or updated group membership and statistic data) on its host. If so, the process ends. If not, the process selects (at 1145) another load balancer on the same host as the LB agent, and then repeats operations 1110-1140 for this newly selected load balancer.
[0081] Figure 12 illustrates a process 1200 that one or more LB controllers in the LB controller set 520 perform in some examples. As shown, the process 1200 starts (at 1205) when it receives statistics from one or more LB agents and/or receives membership updates for one or more DCN groups. The process 1200 in some examples receives the group membership updates from another process of the LB controller set. For instance, the LB controller set informs the process 1200 that a new DCN VM has been added to or removed from a DCN group when it is informed by the virtualization manager set 525 that a new VM has been created for or terminated from the DCN group.
[0082] After 1205, the process updates (at 1210) (1) the global statistics that the LB controller set 520 maintains based on the statistics received at 1205, and/or (2) the group membership(s) that the LB controller set 520 maintains based on the group updates received at 1205. Next, at 1215, the process determines based on the updated statistics whether it should have one or more SCN or DCN VM specified for or removed from the group. For instance, when the updated statistics causes the aggregated statistics for a DCN group to exceed an overall threshold load value for the DCN group, the process 1200 determines that one or more new DCNs have to be specified (e.g., allotted or instantiated) for the DCN group to reduce the load on DCNs previously specified for the group. Similarly, when the updated statistics causes the aggregated statistics for one or more DCN in the DCN group to exceed a threshold load value, the process 1200 may determine that one or more new DCNs have to be specified (e.g., allotted or instantiated) for the DCN group to reduce the load on the congested DCNs. Conversely, when the updated statistics shows that a DCN in a DCN group is being underutilized or is no longer being used to handle any flows, the process 1200 determines (at 1215) that the DCN has to be removed for the DCN group.
[0083] When the process 1200 determines (at 1215) that it should have one or more SCN or DCN VM added or removed for the group, the process requests (at 1220) the VM managing set 525 to add or remove this VM, and then transitions to 1225. The process also transitions to 1225 when it determines (at 1215) that no SCN or DCN VM needs to be added or removed for the group. At 1225, the process determines whether the time has reached for it to distribute membership update and/or global statistics that the LB controller set maintains to one or more LB agents executing on one or more hosts.
[0084] In some examples, the process 1200 distributes membership updates and/or global statistics on a periodic basis. In other examples, however, the process 1200 distributes membership update and/or global statistics for one or more DCN groups whenever this data is modified. In addition to requesting the addition or removal of a VM from a group, the group membership can change when a VM that is part of a group fails. Such VM failures would have to be relayed to the LB agents so that they can modify the LB rules of their associated load balancers. In some examples, the membership update data that the process 1200 distributes, differentiates a failed DCN from an intentionally removed DCN (i.e., a DCN that has not failed but has been removed from the DCN group). This differentiation allows a load balancer's operation to be differently modified for the failed DCN and the intentionally removed DCN. For the failed DCN, the load balancer stops using the failed DCN, while for an intentionally removed DCN, the load balancer in some examples can continue to use the removed DCN for a duration of time after receiving the membership update (e.g., for new flows up to a particular time, or for previously received flows that are being processed by the DCN). To cause the load balancer to stop using the failed DCN, the connection records that specify the failed DCN in the load balancer's connection storage 690 are removed in some examples.
[0085] When the process determines (at 1225) that it does not need to distribute new data, it transitions to 1230 to determine whether it has received any more statistic and/or membership updates for which it needs to update its records. If so, the process transitions back to 1210 to process the newly received statistic and/or membership updates. If not, the process transitions back to 1225 to determine again whether it should distribute new data to one or more LB agents.
[0086] When the process determines (at 1225) that should distribute membership update(s) and/or global statistics, it distributes (at 1235) this data to one or more LB agents that need to process this data to specify and/or update the load balancing rules that they maintain for their load balancers on their hosts. After 1235, the process determines (at 1240) whether it has received any more statistic and/or membership updates for which it needs to update its records. If not, the process remains at 1240 until it receives statistics and/or membership updates, at which time it transitions back to 1210 to process the newly received statistic and/or membership updates.
[0087] In the examples described above by reference to Figures 10-12 , the LB controller set 520 distributes global statistics to the LB agents, which analyze this data to specify and/or adjust the LB rules that they maintain. In other examples, however, the LB controller set 520 analyzes the global statistics that it gathers, and based on this analysis specifies and/or adjusts LB rules, which it then distributes to the LB agents. In these examples, the LB agents simply store the LB rules or rule modifications that they receive from the LB controller set in the host-level LB rule storage 688 for distribution to the individual LB rule storages 440 of the load balancers 615.
[0088] Figure 13 illustrates a process 1300 that shows the operation of the LB controller set for examples in which the LB controller set analyzes the membership updates and/or global statistics, and in response to this analysis specifies and/or updates LB rules if needed. This process is similar to the process 1200 of Figure 12 , except for the inclusion of operation 1312 and the replacement of operations 1225 and 1235 with the operations 1325 and 1335.
[0089] At 1312, the process 1300 analyzes the membership and statistic records and if needed, specifies and/or updates the group memberships (e.g., the IP addresses) and/or the load balancing criteria (e.g., the weight or time values) of one or more LB rules. This operation is similar to the operation 1120 of the process 1100 of Figure 11 , except when performed by the process 1300 of the LB controller set, the operation 1312 might generate LB rules or rule updates for the load balancers of multiple hosts. From 1312, the process transitions to 1215, which was described above.
[0090] At 1325, the process 1300 determines whether it has to distribute the newly specified and/or updated LB rules. If not, the process transitions to 1230, which was described above. Otherwise, the process transitions to 1335 to distribute the newly specified and/or updated LB rules to the LB agents of the hosts that have load balancers that need to enforce the specified and/or updated LB rules. After 1335, the process transitions to 1240, which was described above.
[0091] Figures 14-16 present several examples that illustrate how some examples dynamically adjust the spreading of traffic by adjusting the load balancing criteria and by adding/removing DCN VMs. Each of these examples is illustrated in terms of multiple operational stages that show several inline load balancers 1400 dynamically adjust how they spread the data traffic from several webserver VMs 1405 to several application server VMs 1410. In these examples, each load balancer 1400 is associated with one webserver 1405, while the application server VMs 1410 are part of one DCN group 1450 that is associated with one virtual address identifier. Also, the load balancers 1400, the web servers 1405, and the application server 1410 execute on one or more hosts. On the hosts, one or more LB agents 620 execute to exchange statistics with the LB controller set 520, in order to allow the load balancing operations to be dynamically updated based on dynamically detected load conditions. For the sake of simplifying these figures, the LB agents 620 are not shown in Figures 14- 16.
[0092] In three operational stages 1401-1403, Figure 14 illustrates an example where the load balancing criteria is adjusted based on dynamically detected load conditions. In this example, each load balancer 1400 uses a weighted round robin scheme to distribute the data messages from its associated webserver 1405. The weight values that control this scheme are adjusted by the LB agent(s) based on global load statistics that are supplied by the LB controller set 520. These statistics specify the load on the application server VMs 1410.
[0093] In the first operational stage 1401 of Figure 14 , each load balancer 1400 evenly distributes the data messages of its webserver VMs 1405 among the application server VMs 1410. This even distribution is depicted in this figure by the designation of 10, 10, 10, 10, and 9 on the lines that start on the load balancer 1400a and terminate on the application servers 1410. These numbers are the numbers of active data flows that the load balancer 1400a is directing to the application servers 1410. As shown, the load balancer 1400a in this stage bases its operation on the weight values 1, 1, 1, 1, and 1. These weight values specify that the load balancer should evenly distribute to the five application servers 1410 the next five new data message flows from the webserver 1405a to the application server group 1450.
[0094] The first stage 1401 also shows the LB controller set 520 receiving local connection statistics from each of the load balancers 1400. These statistics are gathered and relayed by the load balancers' LB agents, which are not shown in Figure 14 . The first stage 1401 also shows an example of one of the provided local connection statistics, which is the local statistics 1420 that the load balancer 1400a provides to the LB controller set 520. This local statistics 1420 show that the load balancer 1400a currently has 10, 10, 10, 10, and 9 active flows that it is directing respectively to the application servers 1410a-1410e of the group 1450.
[0095] In different examples, the load balancers use different techniques to quantify the number of active flows that they are directing to each application server 1410. In some examples, the load balancers time out (i.e., remove) flows that are inactive (i.e., for which they have not received any new data messages) after a particular duration of time. Other examples use other techniques to quantify the number of active flows.
[0096] Instead of specifying the number of active flow to express the data traffic load on the DCNs (i.e., the application servers in this example), other examples use other traffic metrics. For instance, the load balancers 1400 collect the number of data messages (e.g., data packets) that they route to each application server 1410 in some examples. Other examples collect other traffic metrics such as TCP RTT and window size, retransmission, etc. Still other examples collect other load metrics (such as round-trip delay, TCP window size, etc.) that express the load that each load balancer detects to each DCN to which the load balancer directs traffic. In some examples, the LB agents of the load balancers measure these other load metrics (e.g., the round-trip delay or TCP window size), while in other examples, the load balancers measure one or more of these load metrics (e.g., the round-trip delay or TCP window size).
[0097] The second stage 1402 shows the LB controller set 520 distributing global load statistics to the LB agents (not shown) of each of the load balancers 1400. The global load statistics in some examples is an aggregation of the local statistics that the load balancers provide (through the LB agent) to the LB controller set 520. The second stage 1402 shows an example of the global connection statistics, which is the global statistics 1425 that the LB agent of the load balancer 1400a receives from the LB controller set 520. As shown, the global statistics in this example show the following numbers of active connections for the five application servers 1410a-1410e: 131, 135, 101, 100, and 86. These numbers of connection represent the numbers of active flows that all five load balancers 1400 are distributing to the five application servers 1410a-1410e from the five webservers 1405.
[0098] Like the gathered local statistics, the distributed global statistics are different types of traffic and/or load metrics in other examples. In some examples, the distributed global statistics include for each DCN in the DCN group, aggregated message traffic data that expresses the data message traffic load on the DCN. Examples of such load data include the number of data messages (e.g., number of packets) received by the DCN, number of flows processed by the DCN, number of data message bytes received by the DCN, etc. In some examples, the metrics can be normalized to units of time, e.g., per second, per minute, etc. Also, in some examples, the distributed global statistics express the data message load on each DCN in terms of a relative congestion percentage that compares the load of the DCN to the load of other DCNs in the group.
[0099] In some examples, the distributed global statistics include an aggregated round trip delay (e.g., average round trip delay) to each DCN, an aggregated TCP window size value (e.g., average TCP window size) for each DCN, etc. Also, in some examples, the distributed global statistics are partially or completely based on metrics that the LB controller set 520 gathers by interacting directly with the DCNs (e.g. with the application servers 1410). In some examples in which the global statistics are completely based on metrics directly gathered by the LB controller set, the LB controller set does not gather statistics that the load balancers 1400 collect locally.
[0100] The second stage 1402 also shows the adjustment of the weight values that the load balancer 1400a uses to spread new flows to the application servers 1410. These weight values are adjusted by the LB agent(s) 620 based on the received global statistics 1425. The weight values after they are adjusted are 1, 1, 2, 2, 3. These weight values direct the load balancer 1400a to spread in a weighted round-robin approach the next nine new data message flows as follows: 1 to the first application server 1410a, 1 to the second application server 1410b, 2 to the third application server 1410c, 2 to the fourth application server 1410d, and 3 to the fifth application server 1410e. As mentioned above, some examples specify and use time period values in the LB rules in order to allow the load balancers to gracefully transition between different weight value sets to dynamically adjust their load balancing operations.
[0101] The third stage 1403 shows that after this adjustment of the weight values, the relative even distribution of flows by the load balancer 1400a becomes skewed towards the application servers 1410 that are associated with the higher weight values, i.e., the application servers 1410c, 1410d, and 1410e. Specifically, this stage shows that once the weight values are adjusted, the number of flows (from the webservers 1405 to the application servers 1410) goes from 20, 20, 20, 19, and 19, to 23, 23, 26, 26, and 28.
[0102] In the example illustrated in Figure 14 , the load balancing criteria (i.e., the weight values in this example) are adjusted by the LB agent(s) based on global statistics distributed by the LB controller set 520. In other examples, however, the LB controller set adjusts and distributes the load balancing criteria based on statistics that the LB controller set collects from the load balancers and/or from the DCN group(s). In these examples, the load balancers use the load balancing criteria distributed by the LB controller set to perform or adjust their load balancing operations. In some of these examples, the LB controller set also initially defines the LB rules with the initial weight values, and distributes these rules to the load balancers (through the LB agents) for the load balancers to store and use.
[0103] In three operational stages 1501-1503, Figure 15 illustrates an example of adding a DCN to a DCN group to alleviate the traffic load on the DCN group members. This example follows the third stage 1403 of the example of Figure 14 . The first stage 1501 of Figure 15 shows the addition (e.g., the allotment or instantiation) of a sixth application sever 1410f to the application server group 1450. This sixth application server 1410f has been added to the group by the LB controller set 520 directing the VM managing controller set 525 to allot a previously created application server VM to this group, or to instantiate a new application server VM for this group.
[0104] The first stage 1501 also shows the LB controller set 520 providing global statistics 1520 and group update 1522 to the LB agents (not shown) of the load balancers 1400. The global statistics 1520 show that each application server is currently handling about 50K flows, which in this example is assumed to be near the threshold maximum number of flows for each application server. As shown, in this stage, the number of flows from load balancer 1400a to the application servers is 20K, 18K, 21K, 17K, and 19K.
[0105] The group update 1522 informs the load balancers that the sixth application server 1410f has been added to the application server group 1450. In response to this group update, the LB agent (not shown) of the webserver 1405a adjusts the weight values of the LB rule that load balancer 1400a of this webserver enforces. As shown in the first stage 1501, the adjusted weight values are 1, 1, 1, 1, 1, 1000. This weight value set directs the load balancer to assign the next 1005 new data flows from the webserver 1405a to the application servers 1410a-1410f based on a weighted round robin scheme that assigns the next five news flows to the applications servers 1410a-1410e, and then assign the next 1000 flows to the application server 1410f.
[0106] After receiving the group update 1522, the LB rules of the other load balancers of the other webservers 1405 are similarly adjusted by their respective LB agent(s). In response to these adjusted weight values, the load on the sixth application servers 1410f starts to increase, while the load on the first five application servers 1410a-1410e starts to decrease, as shown in the second stage 1502. The second stage 1502 shows the LB controller set providing updated global statistics 1525 to the LB agents (not shown) of the load balancers 1400. The updated global statistics 1525 shows that the load on the five application servers 1410a-1410e has dropped to 40K, 39K, 41K, 38K and 39K, while the load on the sixth application server 1410f has risen to 18K. In this stage, the number of flows from load balancer 1400a to the application servers is now 14K, 12K, 13K, 15K, 16K, and 8K.
[0107] The second stage 1502 also shows that in response to the updated global statistics, the weight values for the load balancer 1400a have been adjusted to be 1, 1, 1, 1, 3. After receiving the global statistics 1525, the weight values of the other load balancers of the other webservers 1405 are also adjusted by their respective LB agent(s). The third stage 1503 then shows that in response to these weight value adjustments, the load across the application servers 1410 has reached 44K, 42K, 43K, 45K, 46K, and 35K, as indicated in the updated global statistics 1535. In this stage, the number of flows from load balancer 1400a to the application servers is now 12K, 12K, 13K, 14K, 13K, and 13K.
[0108] In three operational stages 1601-1603, Figure 16 illustrates an example of removing a DCN from a DCN group when fewer DCNs are needed to handle the load on the DCN group. This example follows the third stage 1503 of the example of Figure 15 . The first stage 1601 of Figure 16 shows the LB controller set 520 providing global statistics 1620 and group update 1622 to the LB agents (not shown) of the load balancers 1400. The global statistics 1620 show that the application servers 1410 respectively handling 22K, 26K, 27K, 28K, 28K, and 26K flows. As shown, in this stage, the number of flows from load balancer 1400a to the application servers is 6K, 7K, 10K, 9K, 10K and 10K.
[0109] The first stage 1601 also shows the LB controller set 520 providing a group update 1622 that informs the load balancers that the first application server 1410a should be removed from the application server group 1450. In response to this group update, the LB agent (not shown) of the webserver 1405a adjusts the weight values of the LB rule that load balancer 1400a of this webserver enforces. As shown in the first stage 1601, the adjusted weight values are 0, 2, 1, 1, 1, 1. This weight value set directs the load balancer to assign the next 6 new data flows from the webserver 1405a to the application servers 1410b-1410f based on a weighted round robin scheme that assigns the next two news flows to the applications server 1410b, and then assign the next four flows individually to each of the four application servers 1410c-1410f.
[0110] After receiving the group update 1622, the LB rules of the other load balancers of the other webservers 1405 are similarly adjusted by their respective LB agent(s). In response to these adjusted weight values, the load on the first application server 1410a starts to decrease, while the load on the other five application servers 1410b-1410f starts to increase, as shown in the second stage 1602. The second stage 1602 shows the LB controller set providing updated global statistics 1625 to the LB agents (not shown) of the load balancers 1400. The updated global statistics 1625 shows that the load on the application server 1410a has dropped down to 12K flows, while the load on the application servers 1410b-1410f has increased to 30K, 32K, 31K, 32K and 30K flows. In this example, the load on the application server 1410a does not immediately fall to zero because this server continues to receive data messages for flows that it has been processing.
[0111] The second stage also shows the number of flows from load balancer 1400a to the application servers to now be 5K, 8K, 9K, 8K, 10K, and 9K. The second stage 1602 further shows that in response to the updated global statistics, the weight values for the load balancer 1400a have been adjusted to be 0, 1, 1, 1, 1, 1. After receiving the global statistics 1625, the weight values of the other load balancers of the other webservers 1405 are also adjusted by their respective LB agent(s).
[0112] The third stage 1603 then shows that in response to these weight value adjustments, the application server 1410a has effectively been removed from the DCN group 1450 as it no longer receives any flows from the load balancers 1400. This stage also shows that the load on the other application servers 1410b-f has reached 40K, 39K, 41K, 38K and 39K flows, as indicated in the updated global statistics 1635. In this stage, the number of flows from load balancer 1400a to the application servers is now 0, 12K, 13K, 14K, 13K, and 13K.
[0113] Examples above show the addition of new DCNs to alleviate the traffic load. In some examples, the load on the DCNs can be adjusted by adding or removing SCN VMs. Also, even though the LB rules in the above-described examples include weight values that facilitate the load balancers dynamic adjustment of the load, one of ordinary skill in the art will realize that in other examples the load balancers use other mechanisms for dynamically adjusting the data traffic load based on dynamically detected load conditions.
[0114] In the above-described examples, the load balancers are described as balancing the data traffic between different layers of data compute end nodes (DCENs), such as webservers, application servers and database servers. However, in some examples, the distributed load balancing architecture can be used to load balance the data traffic to and from middlebox service nodes. In other words, the DCNs in the DCN group in some examples can be middlebox service nodes (such as firewalls, intrusion detectors, WAN optimizers, etc.).
[0115] Also, as illustrated in Figure 17 , the inline load balancers in some examples can be configured to route data messages that are sent to DCENs initially to a set of middlebox service nodes. In this example, inline load balancers 1700 (associated with the webserver VMs 1705) direct the data traffic that web servers 1705 send to application servers 1710, to firewall middlebox VMs 1720. In directing the data messages to the firewalls 1720, the inline load balancers perform load balancing operations that spread the data message load among the firewalls 1720. Once processed by the firewalls 1720, the firewall-filtered data messages are distributed by the inline load balancers 1730 (associated with the firewall VMs 1720) to the application servers 1710. As shown, in this example, the firewall servers are service VMs executing on the same hosts as the webservers and application servers.
[0116] To direct to the firewall VMs the data traffic that is addressed to the application servers' virtual address (e.g., VIP), the load balancers 1700 in some examples (1) perform a virtual address (e.g., a VIP) translation that replaces the application server virtual address with the firewall VM's virtual address, and then (2) spread the received data traffic amongst the firewall VMs based on their load balancing criteria. In some examples, the load balancers 1700 address translation inserts identifiers in the message identifiers (e.g., in the packet header) that allows the firewall VMs 1720 and load balancers 1730 to determine that their received messages are directed to application servers 1710. To make this determination, the load balancers 1730 are configured with rules that enable the load balancers to associate the received data messages with the application servers 1710 in some examples.
[0117] As mentioned above, an inline load balancer 615 of a VM 605 can perform multiple different load balancing operations for multiple different groups of DCNs. This is because the load balancer 615 can apply the load balancing rules of multiple different groups of DCNs. These rules are stored in the load balancing data storage 440, as described above by reference to Figures 6-8 .
[0118] Figure 18 illustrates that one inline load balancer can form multiple different distributed load balancers with multiple different sets of inline load balancers. Figure 18 presents two sets of inline load balancers that distribute the data messages of two different sets 1805 and 1810 of VMs to two different groups of DCNs 1820 and 1825. One VM, VM1, is part of both sets 1805 and 1810 of VMs. Each inline load balancer is analogous to the inline load balancer 615 of Figure 6 .
[0119] As shown in Figure 18 , the inline load balancer 1815 of VM1 enforces load balancing rules 1850 and 1855 that are stored in its load balancing storage 1840. These load balancing rule 1850 and 1855 direct the load balancer 1815 to distribute data messages of VM1 that are directed respectively to DCN groups 1820 and 1825 to the DCNs in these groups. Also, in this example, the inline load balancer 1870 of the virtual machine VM2 enforces a LB rule for distributing data messages for DCN group 1820, while the inline load balancer 1875 of the virtual machine VM3 enforces a LB rule for distributing data messages for DCN group 1825. The LB rules of the inline load balancers 1815 and 1870 of VM1 and VM2 for DCN group 1820 can have identical LB criteria or different LB criteria. Similarly, the LB rules of the inline load balancers 1815 and 1870 of VM1 and VM3 for DCN group 1825 can have identical LB criteria or different LB criteria. These load balancing rules (e.g., rules 1850 and 1855) and their associated load balancing storage (e.g., storage 1840) are analogous to the load balancing rules 700 and 800 and the load balancing storage 440 of Figures 6-8 .
[0120] As shown in Figure 18 , the inline load balancers (e.g., 1815 and 1870 of VM1 and VM2) of VM group 1805 form a distributed load balancer 1880 that distributes the data messages from VMs of group 1805 amongst the DCNs of DCN group 1820. Similarly, as shown, the inline load balancers (e.g., 1815 and 1875 of VM1 and VM3) of VM group 1810 form a distributed load balancer 1890 that distributes the data messages from VMs of group 1810 amongst the DCNs of DCN group 1825.
[0121] Each distributed load balancer 1880 or 1890 is logical construct as it is not one item in the physical world, but rather conceptually represents one set of load balancing operations that a group of associated inline load balancers performs to distribute the data message load on a DCN group. In this distributed approach, each inline load balancer only needs to store the load balancing rules of the distributed load balancer that it implements. In other words, each inline load balancer in this distributed approach only needs to store the load balancing rules of the DCN-group data messages that its associated VM might send out. Also, in this distributed approach, each inline load balancer needs to only maintain in its connection data store (e.g., connection data storage 690) the flow connection states of the data message flows sent by the load balancer's associated VM. Because of all of these reasons, the inline load balancers of some examples are fast and efficient as they maintain small LB rule and connection state data storages that they can search quickly.
[0122] In the example illustrated in Figure 18 , the inline load balancer 1815 is shown to be part of two distributed load balancers 1880 and 1890 by being part of two sets of associated load balancers, one for the VM group 1805 and another for the VM group 1810. In other examples, an inline load balancer can be part of any arbitrary number N of distributed load balancers, when with another set of N inline load balancers it enforces N load balancing rules for data messages that are being directed to N different DCN groups.
[0123] Figure 19 illustrates another example that illustrates that the inline load balancers of some examples can differently translate the virtual addresses of data messages to different groups of DCNs. Specifically, this figure illustrates five inline load balancers 1900 of five webservers 1905 that direct and load balance data messages to a first VIP associated with a first group 1910 of application servers to the application servers 1915 and 1920 of this group 1910, while directing and load balancing data messages to a second VIP associated with a second group 1930 of application servers to the application servers 1935 and 1940 of this group 1930.
[0124] In some examples, the inline load balancers differently direct and load balance data messages that are addressed to the same virtual address. For instance, some examples define priority sub-groups within an addressed DCN group, and load balance different priority data messages to different sub-groups based on their priority. For example, Figure 20 illustrates a set of inline load balancers 2000 that direct webserver data messages to a group 2050 of application servers to either a high-priority sub-group 2040 of application servers or a low priority group 2045 of application servers based on the assessed priority of the data messages.
[0125] In different examples, the load balancers 2000 assess the priority of the data messages from the webservers 2005 differently. For instance, in some examples, the load balancers assess the priority of the data messages based on identity of the sources from which the webserver received the data messages. After assessing the priority of the data messages, the load balancers direct the received data messages to the application server sub-group with the corresponding priority.
[0126] Many of the above-described features and applications are implemented as software processes that are specified as a set of instructions recorded on a computer readable storage medium (also referred to as computer readable medium). When these instructions are executed by one or more processing unit(s) (e.g., one or more processors, cores of processors, or other processing units), they cause the processing unit(s) to perform the actions indicated in the instructions. Examples of computer readable media include, but are not limited to, CD-ROMs, flash drives, RAM chips, hard drives, EPROMs, etc. The computer readable media does not include carrier waves and electronic signals passing wirelessly or over wired connections.
[0127] In this specification, the term "software" is meant to include firmware residing in read-only memory or applications stored in magnetic storage, which can be read into memory for processing by a processor. Also, in some examples, multiple software inventions can be implemented as sub-parts of a larger program while remaining distinct software inventions. In some examples, multiple software inventions can also be implemented as separate programs. Finally, any combination of separate programs that together implement a software invention described here is within the scope of the invention. In some examples, the software programs, when installed to operate on one or more electronic systems, define one or more specific machine implementations that execute and perform the operations of the software programs.
[0128] Figure 21 conceptually illustrates a computer system 2100 with which some examples of the invention are implemented. The computer system 2100 can be used to implement any of the above-described hosts, controllers, and managers. As such, it can be used to execute any of the above described processes. This computer system includes various types of non-transitory machine readable media and interfaces for various other types of machine readable media. Computer system 2100 includes a bus 2105, processing unit(s) 2110, a system memory 2125, a read-only memory 2130, a permanent storage device 2135, input devices 2140, and output devices 2145.
[0129] The bus 2105 collectively represents all system, peripheral, and chipset buses that communicatively connect the numerous internal devices of the computer system 2100. For instance, the bus 2105 communicatively connects the processing unit(s) 2110 with the read-only memory 2130, the system memory 2125, and the permanent storage device 2135.
[0130] From these various memory units, the processing unit(s) 2110 retrieve instructions to execute and data to process in order to execute the processes of the invention. The processing unit(s) may be a single processor or a multi-core processor in different examples. The read-only-memory (ROM) 2130 stores static data and instructions that are needed by the processing unit(s) 2110 and other modules of the computer system. The permanent storage device 2135, on the other hand, is a read-and-write memory device. This device is a non-volatile memory unit that stores instructions and data even when the computer system 2100 is off. Some examples of the invention use a mass-storage device (such as a magnetic or optical disk and its corresponding disk drive) as the permanent storage device 2135.
[0131] Other examples use a removable storage device (such as a floppy disk, flash drive, etc.) as the permanent storage device. Like the permanent storage device 2135, the system memory 2125 is a read-and-write memory device. However, unlike storage device 2135, the system memory is a volatile read-and-write memory, such a random access memory. The system memory stores some of the instructions and data that the processor needs at runtime. In some examples, the invention's processes are stored in the system memory 2125, the permanent storage device 2135, and/or the read-only memory 2130. From these various memory units, the processing unit(s) 2110 retrieve instructions to execute and data to process in order to execute the processes of some examples.
[0132] The bus 2105 also connects to the input and output devices 2140 and 2145. The input devices enable the user to communicate information and select commands to the computer system. The input devices 2140 include alphanumeric keyboards and pointing devices (also called "cursor control devices"). The output devices 2145 display images generated by the computer system. The output devices include printers and display devices, such as cathode ray tubes (CRT) or liquid crystal displays (LCD). Some examples include devices such as a touchscreen that function as both input and output devices.
[0133] Finally, as shown in Figure 21 , bus 2105 also couples computer system 2100 to a network 2165 through a network adapter (not shown). In this manner, the computer can be a part of a network of computers (such as a local area network ("LAN"), a wide area network ("WAN"), or an Intranet, or a network of networks, such as the Internet. Any or all components of computer system 2100 may be used in conjunction with the invention.
[0134] Some examples include electronic components, such as microprocessors, storage and memory that store computer program instructions in a machine-readable or computer-readable medium (alternatively referred to as computer-readable storage media, machine-readable media, or machine-readable storage media). Some examples of such computer-readable media include RAM, ROM, read-only compact discs (CD-ROM), recordable compact discs (CD-R), rewritable compact discs (CD-RW), read-only digital versatile discs (e.g., DVD-ROM, dual-layer DVD-ROM), a variety of recordable/rewritable DVDs (e.g., DVD-RAM, DVD-RW, DVD+RW, etc.), flash memory (e.g., SD cards, mini-SD cards, micro-SD cards, etc.), magnetic and/or solid state hard drives, read-only and recordable Blu-Ray® discs, ultra density optical discs, any other optical or magnetic media, and floppy disks. The computer-readable media may store a computer program that is executable by at least one processing unit and includes sets of instructions for performing various operations. Examples of computer programs or computer code include machine code, such as is produced by a compiler, and files including higher-level code that are executed by a computer, an electronic component, or a microprocessor using an interpreter.
[0135] While the above discussion primarily refers to microprocessor or multi-core processors that execute software, some examples are performed by one or more integrated circuits, such as application specific integrated circuits (ASICs) or field programmable gate arrays (FPGAs). In some examples, such integrated circuits execute instructions that are stored on the circuit itself.
[0136] As used in this specification, the terms "computer", "server", "processor", and "memory" all refer to electronic or other technological devices. These terms exclude people or groups of people. For the purposes of the specification, the terms display or displaying means displaying on an electronic device. As used in this specification, the terms "computer readable medium," "computer readable media," and "machine readable medium" are entirely restricted to tangible, physical objects that store information in a form that is readable by a computer. These terms exclude any wireless signals, wired download signals, and any other ephemeral or transitory signals.
[0137] While the invention has been described with reference to numerous specific details, one of ordinary skill in the art will recognize that the invention can be embodied in other specific forms without departing from the spirit of the invention. For instance, while the load balancing processes were described above by reference to several host architecture, one of ordinary skill in the art will realize that these processes could be implemented in a variety of different architectures that load balance messages at variety of different locations along their egress path out of the host. For instance, in some examples, the load balancing processes are implemented in the PNIC of the host. In other words, the PNIC of the host in some examples examines the VM messages to determine whether it should load balance them before sending them out of the host or sending them to their destination GVMs.
[0138] In many of the above-described examples, the virtual addresses are VIPs, which the load balancers replace by physical IP addresses of the DCN VMs. However, one of ordinary skill in the art will realize that, in other examples, the virtual addresses are different types of addresses and the load balancers perform other address translation operations. For example, in some examples, the load balancer translates a virtual port address to a physical port address (i.e., performs L4 address translation operations), instead of or in conjunction with performing the IP network address translation (to replace the VIP with a physical IP address). In still other examples, the load balancer directs a data message to a DCN in a DCN group through MAC redirection operation, which replaced one MAC address with the MAC address of the DCN that should received the data messages. In some examples, the DCNs are connected to one distributed logical switch that logically spans multiple hosts, and the MAC redirection directs a data message that is addressed to one port of the logical switch to another port of the logical switch.
[0139] In many of the above-described examples, an LB agent adjusts the load balancing criteria for the load balancers that execute on its host based on the data distributed by the controller set. One of ordinary skill will realize that in other examples, the load balancers themselves adjust their load balancing criteria based on the data distributed by the controller set.
[0140] This specification refers throughout to computational and network environments that include virtual machines (VMs). However, virtual machines are merely one example of a compute node, also referred to as addressable nodes. Some examples of the invention are equally applicable to any computing node that utilizes a port abstraction defined on a host computing device to allow multiple programs that execute on the host to share common resources on the host. As such, the compute nodes in some examples may include non-virtualized physical hosts, virtual machines, containers that run on top of a host operating system without the need for a hypervisor or separate operating system, and hypervisor kernel network interface modules.
[0141] VMs, in some examples, operate with their own guest operating systems on a host using resources of the host virtualized by virtualization software (e.g., a hypervisor, virtual machine monitor, etc.). The tenant (i.e., the owner of the VM) can choose which applications to operate on top of the guest operating system. Some containers, on the other hand, are constructs that run on top of a host operating system without the need for a hypervisor or separate guest operating system. In some examples, the host operating system uses name spaces to isolate the containers from each other and therefore provides operating-system level segregation of the different groups of applications that operate within different containers. This segregation is akin to the VM segregation that is offered in hypervisor-virtualized environments that virtualize system hardware, and thus can be viewed as a form of virtualization that isolates different groups of applications that operate in different containers. Such containers are more lightweight than VMs.
[0142] Hypervisor kernel network interface modules, in some examples, is a non-VM DCN that includes a network stack with a hypervisor kernel network interface and receive/transmit threads. One example of a hypervisor kernel network interface module is the vmknic module that is part of the ESXi™ hypervisor of VMware, Inc.
[0143] One of ordinary skill in the art will recognize that while the specification refers to VMs, the examples given could be any type of DCNs, including physical hosts, VMs, non-VM containers, and hypervisor kernel network interface modules. In fact, the example networks could include combinations of different types of DCNs in some examples.
[0144] A number of the figures (e.g., Figures 9-13 ) conceptually illustrate processes. The specific operations of these processes may not be performed in the exact order shown and described. The specific operations may not be performed in one continuous series of operations, and different specific operations may be performed in different examples. Furthermore, the process could be implemented using several sub-processes, or as part of a larger macro process. In view of the foregoing, one of ordinary skill in the art would understand that the invention is not to be limited by the foregoing illustrative details, but rather is to be defined by the appended claims.
[0145] In one example there has been described a non-transitory machine readable medium storing a program for load balancing data messages that are sent by a source compute node (SCN) to a group of destination compute nodes (DCNs), the program comprising sets of instructions for: intercepting a data message sent by the SCN and determining whether the data message is addressed to the DCN group; and when the data message is addressed to the DCN group, identifying a DCN in the DCN group that should receive the data message, and sending the data message to the identified DCN; wherein the program and the SCN execute on the host computing device.
[0146] In one example, the set of instructions for sending the data message to the identified DCN comprises a set of instructions for changing a destination address in the data message from an address of the identified DCN group to an address of the identified DCN.
[0147] In one example, the addresses include Internet Protocol (IP) addresses.
[0148] In one example, the addresses include media access control (MAC) addresses.
[0149] In one example, the addresses include Internet Protocol (IP) port addresses.
[0150] In one example, the set of instructions for sending the data message to the identified DCN further comprises a set of instructions for supplying the data message, after the destination address has been changed, to a software forwarding element (SFE) that executes on the host computing device, in order to have the SFE forward the data message to the identified DCN.
[0151] In one example, the SFE forwards the data message to the identified DCN through a set of intermediate forwarding elements that operate outside of the host computing device.
[0152] In one example, the identified DCN executes on the host computing device, and the SFE forwards the data message directly to the identified DCN.
[0153] In one example, the program further comprising a set of instructions for forwarding the data message to a DCN that is not in the DCN group when the data message is not addressed to the DCN group.
[0154] In one example, the set of instructions for forwarding the data message to a DCN that is not in the DCN group comprises a set of instructions for supplying the data message to a software forwarding element (SFE) that executes on the host computing device, in order to have the SFE forward the data message to the DCN that is not in the DCN group.
[0155] In one example, the program is a load balancer that is deployed on the host computing device in the egress datapath of the SCN.
[0156] In one example, the SCN's egress datapath comprises a datapath along which the data messages that are transmitted by the SCN are sent out of the host computing device.
[0157] In one example, the SCN is not configured to send the data messages to the load balancer, the SCN does not address the data messages to the load balancer, the data messages sent from the SCN to the DCN group are load balanced among the DCNs in the group as the data messages specify the DCN group address.
[0158] In one example, the DCN group is a first DCN group, the program further comprising sets of instructions for: determining whether the data message is addressed to a second DCN group; and when the data message is addressed to the second DCN group, identifying a DCN in the second DCN group that should receive the data message, and sending the data message to the identified DCN in the second DCN group.
[0159] In one example, DCNs include data compute end nodes or middle box service nodes.
[0160] In one example, the set of controllers includes more than one controller.
[0161] In one example, the set of controllers includes only one controller.
[0162] In one example, identifying the DCN comprises selecting the DCN from the group of DCNs based on a set of load balancing criteria that is specified in order to spread data messages that are sent to the DCN group among the DCNs in the group.
[0163] In one example, the set of load balancing criteria is specified to evenly spread data messages among the DCNs in the group.
[0164] In one example there has been described a method for load balancing data messages that are sent to a group of destination compute nodes (DCNs), the method comprising, on a host computing device, intercepting data messages that are addressed to the DCN group as the data messages traverse along an egress datapath of the host computing device; for each intercepted data message, identifying one DCN in the DCN group that should receive the intercepted data message; sending each intercepted data message to the DCN that is identified for that intercepted data message.
[0165] In one example, the DCNs for the different data messages are identified based on a set of load balancing criteria that distributes the data messages across the DCNs of the DCN group.
[0166] In one example, the DCN group is a first DCN group, wherein the method further comprises: on the host computing device, intercepting data messages that are addressed to a second DCN group; for each intercepted data message addressed to the second DCN group, identifying one DCN in the second DCN group that should receive the intercepted data message; sending each intercepted data message that was addressed to the second DCN group to the DCN in the second group that is identified for that intercepted data message.
[0167] In one example there has been described a distributed load balancing (DLB) system comprising: a plurality of host computing devices on which a plurality of source compute nodes (SCNs) execute; at least one load balancer executing on each host computing device, each load balancer for intercepting data messages sent by a set of SCN on the load balancer's host computing device, and distributing the data messages that are addressed to a destination compute node (DCN) group to the DCNs in the group; and a set of controllers for configuring the load balancers on the host computing device.
[0168] In one example, the set of controllers includes at least controller for deploying at least one SCN on the host computing devices.
[0169] In one example, the set of controllers configures the load balancers by distributing data regarding data message traffic received at different DCNs in the DCN group; and the distributed message traffic data comprises statistic data that the controller set gathers from the load balancers.
[0170] In one example the DLB system further comprises a load balancing agent executing on each host computing device to compute load balancing criteria for at least one load balancer executing on the host based on the distributed message traffic data, wherein each load balancer distributes the data messages among the DCNs in the DCN group based on the load balancing criteria.
[0171] In one example, the set of controllers configures the load balancers by distributing load balancing criteria to the load balancers.
[0172] In one example, the set of controllers further configures the load balancers by collecting message traffic data from the load balancers, adjusting the load balancing criteria, and distributing adjusted load balancing criteria to the load balancers.
[0173] In one example, the message traffic data from a load balancer comprises data regarding the number of data messages that the load balancer forwarded to each DCN in the DCN group.
[0174] In one example, the message traffic data from a load balancer comprises data regarding the number of data message flows that the load balancer forwarded to each DCN in the DCN group.
[0175] In one example, each load balancer spreads the data messages among the DCNs based on a set of load balancing criteria, and the set of controllers configures the load balancers by collecting message traffic data from the load balancers, adjusting the load balancing criteria, and distributing adjusted load balancing criteria to the load balancers.
[0176] In one example, the distributed load balancing criteria specifies numerical values that affect how the load balancer spreads the data messages among the DCN.
[0177] In one example, the numerical values include one weight value for each DCN, wherein based on the weight values, the load balancer performs a weighted round robin selection of the DCNs for new data message flows.
[0178] In one example, the set of controllers configures the load balancers by distributing addresses of the DCNs that are part of the DCN group.
[0179] In one example, the set of controllers configures the load balancers by (i) distributing a list of the DCNs that are part of the DCN group, and (2) when a DCN is to be added or removed from the DCN group, distributing an updated list that adds or removes the added or removed DCN from the DCN group.
[0180] In one example, each load balancer spreads the data messages among the DCNs based on a set of load balancing criteria, the set of controllers is further for collecting message traffic data from the load balancers, generating aggregated message traffic data from the collected message traffic data, and distributing the aggregated message traffic data to the load balancers; based on the distributed aggregated message traffic data, each load balancer adjusts the set of load balancing criteria that the load balancer uses to spread the data messages among the DCNs.
[0181] In one example, the load balancing criteria specifies numerical values that control how the load balancer spreads the data messages among the DCN.
[0182] In one example, the set of controllers includes more than one controller.
[0183] In one example, the set of controllers includes only one controller.
[0184] In one example, each load balancer is for processing data messages from one SCN on one host computing device, the controller set is further for deploying the SCNs on the host computing device, and the controller set configures each load balancer as part of the deployment of the load balancer's SCN.
[0185] In one example, each load balancer spreads the data messages among the DCNs by changing a destination address in each data message from an address of the identified DCN group to an address of one of the DCN; and wherein each address is one of an Internet Protocol (IP) address, a media access control (MAC) address, and an Internet Protocol (IP) port address.
[0186] In one example, after changing the destination address of a data message, the load balancer supplies the data message to a software forwarding element (SFE) that executes on the host computing device with the load balancer, in order to have the SFE forward the data message to the identified DCN.
[0187] In one example, the SFE forwards the data message to the identified DCN either (i) through a set of intermediate forwarding elements that operate outside of the host computing device when the DCN is outside of the SFE's host computing device, or (ii) directly to the DCN when the DCN executes on the SFE's host computing device.
[0188] In one example, the SCNs are not configured to send the data messages to the load balancers, and thereby do not address the data messages to the load balancers.
[0189] In one example, the DCN group is a first DCN group, and at least one load balancer is further for intercepting, on the load balancer's host computing device, data messages that are sent by at least one SCN to a second DCN group and spreading the data messages among the DCNs of the second group.
[0190] In one example, the DCNs are data compute end nodes or middle box service nodes, and the SCNs are virtual machines.
[0191] In one example, each load balancer is for processing data messages from a set of one or more SCNs on one host computing device; and each load balancer enforces the load balancing rules needed to spread the data traffic from the load balancer's associated SCN set.
[0192] In one example there has been described a method for managing load balancers that load balance data messages that source compute nodes send to at least one group of destination compute nodes (DCNs), the method comprising: collecting message traffic data relating to data messages that the load balancers distribute to the DCNs of the DCN group; generating aggregated message traffic data from the collected message traffic data; based on the aggregated message traffic data, distributing adjustment data to at least a subset of the load balancers to cause the subset of load balancers to adjust how they distribute new data message flows to the DCNs of the DCN group.
[0193] In one example, the distributed adjustment data comprises aggregated message traffic data that expresses the data message traffic load on each DCN in the DCN group.
[0194] In one example, the aggregated message traffic data expresses the number of flows directed to each DCN.
[0195] In one example, the aggregated message traffic data expresses the number of packets directed to each DCN.
[0196] In one example, the aggregated message traffic data expresses the number of bytes directed to each DCN.
[0197] In one example, the aggregated message traffic data expresses a relative congestion percentage of each DCN.
[0198] In one example, each load balancer spreads the data messages among the DCNs based on a set of load balancing criteria, each load balancer that receives the aggregated message traffic data, analyzes the received aggregated message traffic data and adjusts the load balancer's set of load balancing criteria if the load balancer determines that such an adjustment is needed based on the analysis of the aggregated message traffic data.
[0199] In one example the method includes providing to each load balancer an initial set of load balancing criteria as part of an initial configuration of the load balancer.
[0200] In one example, the load balancing criteria specifies numerical values that affect how the load balancer spreads the data messages among the DCN.
[0201] In one example, the numerical values include one weight value for each DCN, wherein based on the weight values, the load balancer performs a weighted round robin selection of the DCNs for new data message flows.
[0202] In one example, each load balancer starts with an initial set of load balancing criteria.
[0203] In one example, each load balancer spreads the data messages among the DCNs based on a set of load balancing criteria, wherein the distributed adjustment data comprises an adjusted set of load balancing criteria for each load balancer in the subset of load balancers.
[0204] In one example, each load balancer in the subset of load balancers receives the same adjusted set of load balancing criteria.
[0205] In one example, at least two different load balancers in the subset of load balancers receive different adjusted sets of load balancing criteria.
[0206] In one example, the distributed load balancing criteria set specifies a weight value for each DCN, wherein based on the weight values, the load balancer performs a weighted round robin selection of the DCNs for new data message flows.
[0207] In one example, the distributed adjustment data comprises updated membership data regarding DCNs that are part of the DCN group.
[0208] In one example, the updated membership data identifies at least one new DCN that has been added to the DCN group, and after receiving the updated membership data, the subset of load balancers starts to direct a portion of new data message flows to the newly added DCN.
[0209] In one example the method includes deploying a virtual machine (VM) on one of host computing devices to act as the newly added DCN before sending the updated membership data to the subset of load balancers.
[0210] In one example the method includes, before deploying the VM, determining that the load on the DCNs in the DCN group has passed a particular threshold that requires another DCN to be added to the DCN group.
[0211] In one example, the updated membership data identifies at least one DCN that has been removed from the DCN group due to an operational failure at the DCN, and after receiving the updated membership data, each load balancer in the subset redirects data message flows that were previously being directed to the removed DCN to other DCNs in the group.
[0212] In one example, the updated membership data identifies at least one DCN that has been removed from the DCN group, and after receiving the updated membership data, each load balancer in the subset does not direct new data message flows to the removed DCN.
[0213] In one example, the updated membership data identifies at least one DCN that has been removed from the DCN group, and after receiving the updated membership data, each load balancer in the subset stops directing new data message flows to the removed DCN at a particular time after receiving the updated membership data.
[0214] In one example, the subset of load balancers comprises each load balancer that receives data messages for the DCN group and has to spread the received data messages amongst the DCNs in the DCN group.
[0215] In one example, the subset of load balancers comprises some but not all of the load balancers that receive data messages for the DCN group and have to spread the received data messages amongst the DCNs in the DCN group, the subset of load balancers including only the load balancers that need to adjust their load balancing operations in view of the aggregated message traffic data.
[0216] In one example, the source compute nodes (SCNs) and the load balancers execute on a set of host computing devices, each load balancer is a load balancer for one SCN that executes on the same host computing device as the load balancer, each load balancer intercepts each data message from its associated SCN, determines whether the data message is addressed to the DCN group, and if so, directs the data message to one of the DCNs in the DCN group.
[0217] In one example, each SCN's load balancer directs at least two different data messages that are part of two different data message flows to two different DCNs of the DCN group.
[0218] In one example, after processing each data message, each load balancer supplies the data message to a software forwarding element (SFE) that executes on the load balancer's host computing device, for the SFE to forward the data message to an addressed destination.
[0219] In one example there has been described a non-transitory machine readable medium storing a program for adjusting load balancing operations of a particular load balancer that load balances data messages sent by at least one source compute node (SCN) to a group of destination compute nodes (DCNs), the program comprising sets of instructions for: sending, to a set of controllers, message traffic data relating to data messages that the particular load balancer distributes among the DCNs of the DCN group; receiving, from the set of controllers, load balancing adjustment data that is based on message traffic data collected by the set of controllers from a plurality of load balancers; and adjusting, based on the received adjustment data, how the particular load balancer distributes new data message flows among the DCNs of the DCN group.
[0220] In one example, the adjustment data comprises aggregated message traffic data that expresses the data message traffic load on each DCN in the DCN group.
[0221] In one example, the aggregated message traffic data expresses the number of flows directed to each DCN.
[0222] In one example, the aggregated message traffic data expresses the number of packets directed to each DCN.
[0223] In one example, the aggregated message traffic data expresses the number of bytes directed to each DCN.
[0224] In one example, the aggregated message traffic data expresses a relative congestion percentage of each DCN.
[0225] In one example, the program further comprising sets of instructions for: spreading, from the particular load balancer, the data messages among the DCNs based on a set of load balancing criteria; and analyzing, after receiving the aggregated message traffic data at the particular load balancer, the received aggregated message traffic data and adjusting the particular load balancer's set of load balancing criteria when an adjustment is needed based on the analysis of the aggregated message traffic data.
[0226] In one example, the program further comprising a set of instructions for receiving at the particular load balancer an initial set of load balancing criteria as part of an initial configuration of the load balancer.
[0227] In one example, the load balancing criteria specifies numerical values that affect how the load balancer spreads the data messages among the DCN.
[0228] In one example, the numerical values include one weight value for each DCN, wherein based on the weight values, the load balancer performs a weighted round robin selection of the DCNs for new data message flows.
[0229] In one example, the program further comprising a set of instruction for using an initial set of load balancing criteria to spread the data messages among the DCNs of the DCN group, before receiving the adjustment data.
[0230] In one example, the program further comprising a set of instructions for spreading, from the particular load balancer, the data messages among the DCNs based on a set of load balancing criteria, wherein the adjustment data comprises an adjusted set of load balancing criteria for adjusting the spreading of the data messages to the DCNs of the DCN group.
[0231] In one example, the received load balancing criteria set specifies a weight value for each DCN, and based on the weight values, the load balancer performs a weighted round robin selection of the DCNs for new data message flows.
[0232] In one example, the adjustment data identifies at least one new DCN that has been added to the DCN group, and the program further comprising a set of instructions for directing, after receiving the updated membership data, a portion of new data message flows to the newly added DCN.
[0233] In one example, the adjustment data identifies at least one DCN that has been removed from the DCN group due to an operational failure at the DCN, and the program further comprising a set of instructions for redirecting, after receiving the updated membership data, message data flows that were previously being directed to the removed DCN to other DCNs in the group.
[0234] In one example, the adjustment data identifies at least one DCN that has been removed from the DCN group, and the sets of instructions further comprises a set of instructions for directing, after receiving the updated membership data, all new data message flows to a subset of DCNs in the DCN group that does not include the removed DCN.
[0235] In one example, the adjustment data identifies at least one DCN that has been removed from the DCN group, and the program further comprising a set of instructions for directing, after a particular time subsequent to receiving the updated membership data, all new data message flows to a subset of DCNs in the DCN group that does not include the removed DCN.
[0236] In one example, the set of instructions for directing all new data message flows to the DCN subset comprises a set of instructions for switching from one set of load balancing criteria of a load balancing rule to another set of load balancing criteria after the particular time.
[0237] In one example, the particular load balancer is a load balancer of a source compute node (SCN) that executes on the a host computing device with the particular load balancer, the program further comprising sets of instructions for: intercepting each data message from the SCN; determining whether the data message is addressed to the DCN group; and directing the data message to one of the DCNs in the DCN group when the data message is addressed to the DCN group.
[0238] In one example, the set of instructions for directing the data message comprises a set of instructions for directing at least two different data messages that are part of two different data message flows to two different DCNs of the DCN group.
[0239] In one example, the set of instructions for directing the data message comprises a set of instructions for supplying the data message to a software forwarding element (SFE) that executes on the load balancer's host computing device, for the SFE to forward the data message to an addressed destination.

权利要求:
Claims (15)
[0001] A method for performing load balancing on a plurality of host computers (505, 510, 515) executing a plurality of source machines (605) sending data message flows to a group of destination machines (1820) associated with a common network address, the method comprising:
configuring a plurality of load balancers (615) on the plurality of host computers (505, 510, 515) to implement a distributed load balancer (1880) to distribute data message flows among the destination machines in the group (1820), wherein at least one host computer (505, 510, 515) has at least two load balancers configured to implement the distributed load balancer;
configuring, for each particular source machine (605) on each particular host computer (600), a module (625, 635) that resides along a datapath of the particular source machine and that executes on the particular host computer, to direct a data message to a particular load balancer associated with the particular source machine for the particular load balancer to determine whether the data message is addressed to the common network address and if so to select a destination machine in the group to receive the data message.
[0002] The method of claim 1, wherein configuring the plurality of load balancers comprises configuring one load balancer (615) for each source machine such that when multiple source machines execute on one host computer, multiple load balancers are configured on that host computer with each source machine having its own load balancer.
[0003] The method of claim 1, wherein the load balancers are modules provided by hypervisors executing on the host computers.
[0004] The method of claim 1, wherein the configured modules are ports (635) of software forwarding elements (610) executing on the host computers.
[0005] The method of claim 1, wherein the machines are virtual machines (VMs) (605) and the configured modules are virtual network interface cards (VNICs) (625) of the VMs.
[0006] The method of claim 1, wherein configuring the load balancers comprises providing load balancing criteria to control how the load balancers distribute the data message flows among the destination machines in the group.
[0007] The method of claim 6, wherein the load balancing criteria comprise weight values (715) that affect how the load balancers distribute the data messages flows among the destination machines in the group.
[0008] The method of claim 7, wherein the weight values include one weight value for each destination machine in the group, wherein based on the weight values, each load balancer performs a weighted round robin selection of the destination machines for the data message flows.
[0009] The method of claim 6 further comprising:
configuring the load balancers to provide, to a set of one or more controllers (520), data message traffic statistics regarding the data message flows distributed among the destination machines in the group,
the controller set using the traffic statistics to recompute the load balancing criteria and to distribute the recomputed load balancing criteria to the load balancers.
[0010] The method of claim 6 further comprising:
configuring the load balancers to provide, to a set of one or more controllers (520), data message traffic statistics regarding the data message flows distributed among the destination machines in the group,
the controller set using the traffic statistics to add or remove destination machines to or from the group.
[0011] The method of claim 10, wherein data message traffic statistics comprise at least one of a number of data messages forwarded to each destination machine and a number of data message flows forwarded to each destination machine.
[0012] The method of claim 1, wherein configuring the load balancers comprises providing the load balancers with the network addresses (710) of the destination machines in the group.
[0013] The method of claim 12, whereinthe common network address is a virtual Internet Protocol (IP) (705) address associated with the group,configuring the load balancers comprises providing, to the load balancers, the virtual IP (VIP) address along with a plurality of destination IP addresses (710) with each destination IP address corresponding to a destination machine in the group,each load balancer changing the VIP address in a data message addressed to the group to a destination IP address of one of the destination machines in the group.
[0014] A machine readable medium storing a program which when executed by at least one of the processing units implements the method according to any one of claims 1-13.
[0015] An electronic device comprising:
a set of processing units; and
a machine readable medium storing a program which when executed by at least one of the processing units implements the method according to any one of claims 1-13.

类似技术:

公开号 | 公开日 | 专利标题

US10798023B2|2020-10-06|Edge datapath using user-kernel transports

US10659373B2|2020-05-19|Processing packets according to hierarchy of flow entry storages

US10693763B2|2020-06-23|Asymmetric connection with external networks

US10652129B2|2020-05-12|Specializing virtual network device processing to avoid interrupt processing for high packet rate applications

US10911360B2|2021-02-02|Anycast edge service gateways

US10182035B2|2019-01-15|Implementing logical network security on a hardware switch

US10135714B2|2018-11-20|Servers, switches, and systems with switching module implementing a distributed network operating system

US10404622B2|2019-09-03|Servers, switches, and systems with virtual interface to external network connecting hardware and integrated networking driver

US20190342346A1|2019-11-07|Creating and using remote device management attribute rule data store

US10164881B2|2018-12-25|Route advertisement by managed gateways

US10949248B2|2021-03-16|Load balancing and destination network address translation middleboxes

US9485143B1|2016-11-01|Redundancy of network services in restricted networks

US10333849B2|2019-06-25|Automatic configuration of logical routers on edge nodes

US9680714B2|2017-06-13|Methods, systems, and fabrics implementing a distributed network operating system

US9882776B2|2018-01-30|Methods and apparatus for configuring a virtual network switch

US10698714B2|2020-06-30|Application/context-based management of virtual networks using customizable workflows

US20200119951A1|2020-04-16|Scalable tenant networks

US10802857B2|2020-10-13|Collecting and processing contextual attributes on a host

US10609160B2|2020-03-31|Performing context-rich attribute-based services on a host

US9742697B2|2017-08-22|Integrated server with switching capabilities and network operating system

US20190140984A1|2019-05-09|Traffic and load aware dynamic queue management

EP3410639B1|2021-03-31|Link selection for communication with a service function cluster

US9225597B2|2015-12-29|Managed gateways peering with external router to attract ingress packets

US20200293180A1|2020-09-17|Stage upgrade of image versions on devices in a cluster

US20190238393A1|2019-08-01|Preventing concurrent distribution of network data to a hardware switch by multiple controllers

同族专利:

公开号 | 公开日

US10341233B2|2019-07-02|

US20160094642A1|2016-03-31|

WO2016053373A1|2016-04-07|

US10257095B2|2019-04-09|

US10135737B2|2018-11-20|

EP3201761B1|2020-05-06|

US10320679B2|2019-06-11|

US20160094452A1|2016-03-31|

US20160094643A1|2016-03-31|

US20160094631A1|2016-03-31|

US9774537B2|2017-09-26|

CN107077340A|2017-08-18|

US20160094451A1|2016-03-31|

EP3201761A1|2017-08-09|

US20190288947A1|2019-09-19|

EP3201761A4|2018-05-23|

引用文献:

公开号 | 申请日 | 公开日 | 申请人 | 专利标题

法律状态:
2020-08-14| PUAI| Public reference made under article 153(3) epc to a published international application that has entered the european phase|Free format text: ORIGINAL CODE: 0009012 |

2020-08-14| STAA| Information on the status of an ep patent application or granted ep patent|Free format text: STATUS: THE APPLICATION HAS BEEN PUBLISHED |

2020-09-16| AC| Divisional application: reference to earlier application|Ref document number: 3201761 Country of ref document: EP Kind code of ref document: P |

2020-09-16| AK| Designated contracting states|Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |

2021-03-19| STAA| Information on the status of an ep patent application or granted ep patent|Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |

2021-04-21| RBV| Designated contracting states (corrected)|Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |

2021-04-21| 17P| Request for examination filed|Effective date: 20210316 |

2022-02-06| GRAP| Despatch of communication of intention to grant a patent|Free format text: ORIGINAL CODE: EPIDOSNIGR1 |

2022-02-06| STAA| Information on the status of an ep patent application or granted ep patent|Free format text: STATUS: GRANT OF PATENT IS INTENDED |

2022-02-23| RIC1| Information provided on ipc code assigned before grant|Ipc: G06F9/455 20180101ALI20220118BHEP Ipc: H04L 67/102920220101ALI20220118BHEP Ipc: H04L 67/102520220101ALI20220118BHEP Ipc: H04L 67/101720220101ALI20220118BHEP Ipc: H04L 101/622 20220101ALI20220118BHEP Ipc: H04L 61/252120220101ALI20220118BHEP Ipc: H04L 61/506920220101ALI20220118BHEP Ipc: H04L 47/125 20220101ALI20220118BHEP Ipc: H04L 45/24 20220101ALI20220118BHEP Ipc: H04L 67/100120220101AFI20220118BHEP |

优先权:

申请号 | 申请日 | 专利标题

[返回顶部]